HinP1I recognizes and cleaves the palindromic tetranucleotide sequence G↓CGC in DNA. We report three structures of HinP1I–DNA complexes: in the presence of Ca2+ (pre-reactive complex), in the absence of metal ion (binary complex) and in the presence of Mg2+ (post-reactive complex). HinP1I forms a back-to-back dimer with two active sites and two DNA duplexes bound on the outer surfaces of the dimer facing away from each other. The 10 bp DNA duplexes undergo protein-induced distortions exhibiting features of A-, B- and Z-conformations: bending on one side (by intercalation of a phenylalanine side chain into the major groove), base flipping on the other side of the recognition site (by expanding the step rise distance of the local base pair to Z-form) and a local A-form conformation between the two central C:G base pairs of the recognition site (by binding of the N-terminal helix in the minor groove). In the pre- and post-reactive complexes, two metals (Ca2+ or Mg2+) are found in the active site. The enzyme appears to cleave DNA sequentially, hydrolyzing first one DNA strand, as seen in the post-reactive complex in the crystalline state, and then the other, as supported by the observation that, in solution, a nicked DNA intermediate accumulates before linearization.
The Nervous System Disease NcRNAome Atlas (NSDNA) (http://www.bio-bigdata.net/nsdna/) is a manually curated database that provides comprehensive experimentally supported associations about nervous system diseases (NSDs) and noncoding RNAs (ncRNAs). NSDs represent a common group of disorders, some of which are characterized by high morbidity and disabilities. The pathogenesis of NSDs at the molecular level remains poorly understood. ncRNAs are a large family of functionally important RNA molecules. Increasing evidence shows that diverse ncRNAs play a critical role in various NSDs. Mining and summarizing NSD-ncRNA association data can help researchers discover useful information. Hence, we developed an NSDNA database that documents 24 713 associations between 142 NSDs and 8593 ncRNAs in 11 species, curated from more than 1300 articles. This database provides a user-friendly interface for browsing and searching and allows for data downloading flexibility. In addition, NSDNA offers a submission page for researchers to submit novel NSD-ncRNA associations. It represents an extremely useful and valuable resource for researchers who seek to understand the functions and molecular mechanisms of ncRNA involved in NSDs.
DNA methylation is an important epigenetic modification that has essential roles in cellular processes including gene regulation, development and disease and is widely dysregulated in most types of cancer. Recent advances in sequencing technology have enabled the measurement of DNA methylation at single nucleotide resolution through methods such as whole-genome bisulfite sequencing and reduced representation bisulfite sequencing. In DNA methylation studies, a key task is to identify differences under distinct biological contexts, for example, between tumor and normal tissue. A challenge in sequencing studies is that the number of biological replicates is often limited by the costs of sequencing. The small number of replicates leads to unstable variance estimation, which can reduce accuracy to detect differentially methylated loci (DML). Here we propose a novel statistical method to detect DML when comparing two treatment groups. The sequencing counts are described by a lognormal-beta-binomial hierarchical model, which provides a basis for information sharing across different CpG sites. A Wald test is developed for hypothesis testing at each CpG site. Simulation results show that the proposed method yields improved DML detection compared to existing methods, particularly when the number of replicates is low. The proposed method is implemented in the Bioconductor package DSS.
Human 2'-5' oligoadenylate synthetase-1 (OAS1) is central in innate immune system detection of cytoplasmic double-stranded RNA (dsRNA) and promotion of host antiviral responses. However, the molecular signatures that promote OAS1 activation are currently poorly defined. We show that the 3'-end polyuridine sequence of viral and cellular RNA polymerase III non-coding transcripts is critical for their optimal activation of OAS1. Potentiation of OAS1 activity was also observed with a model dsRNA duplex containing an OAS1 activation consensus sequence. We determined that the effect is attributable to a single appended 3'-end residue, is dependent upon its single-stranded nature with strong preference for pyrimidine residues and is mediated by a highly conserved OAS1 residue adjacent to the dsRNA binding surface. These findings represent discovery of a novel signature for OAS1 activation, the 3'-single-stranded pyrimidine (3'-ssPy) motif, with potential functional implications for OAS1 activity in its antiviral and other anti-proliferative roles.
MspJI belongs to a family of restriction enzymes that cleave DNA containing 5-methylcytosine (5mC) or 5-hydroxymethylcytosine (5hmC). MspJI is specific for the sequence 5(h)mC-N-N-G or A and cleaves with some variability 9/13 nucleotides downstream. Earlier, we reported the crystal structure of MspJI without DNA and proposed how it might recognize this sequence and catalyze cleavage. Here we report its co-crystal structure with a 27-base pair oligonucleotide containing 5mC. This structure confirms that MspJI acts as a homotetramer and that the modified cytosine is flipped from the DNA helix into an SRA-like-binding pocket. We expected the structure to reveal two DNA molecules bound specifically to the tetramer and engaged with the enzyme's two DNA-cleavage sites. A coincidence of crystal packing precluded this organization, however. We found that each DNA molecule interacted with two adjacent tetramers, binding one specifically and the other non-specifically. The latter interaction, which prevented cleavage-site engagement, also involved base flipping and might represent the sequence-interrogation phase that precedes specific recognition. MspJI is unusual in that DNA molecules are recognized and cleaved by different subunits. Such interchange of function might explain how other complex multimeric restriction enzymes act.
AbaSI, a member of the PvuRts1I-family of modification-dependent restriction endonucleases, cleaves deoxyribonucleic acid (DNA) containing 5-hydroxymethylctosine (5hmC) and glucosylated 5hmC (g5hmC), but not DNA containing unmodified cytosine. AbaSI has been used as a tool for mapping the genomic locations of 5hmC, an important epigenetic modification in the DNA of higher organisms. Here we report the crystal structures of AbaSI in the presence and absence of DNA. These structures provide considerable, although incomplete, insight into how this enzyme acts. AbaSI appears to be mainly a homodimer in solution, but interacts with DNA in our structures as a homotetramer. Each AbaSI subunit comprises an N-terminal, Vsr-like, cleavage domain containing a single catalytic site, and a C-terminal, SRA-like, 5hmC-binding domain. Two N-terminal helices mediate most of the homodimer interface. Dimerization brings together the two catalytic sites required for double-strand cleavage, and separates the 5hmC binding-domains by ~70 Å, consistent with the known activity of AbaSI which cleaves DNA optimally between symmetrically modified cytosines ~22 bp apart. The eukaryotic SET and RING-associated (SRA) domains bind to DNA containing 5-methylcytosine (5mC) in the hemi-methylated CpG sequence. They make contacts in both the major and minor DNA grooves, and flip the modified cytosine out of the helix into a conserved binding pocket. In contrast, the SRA-like domain of AbaSI, which has no sequence specificity, contacts only the minor DNA groove, and in our current structures the 5hmC remains intra-helical. A conserved, binding pocket is nevertheless present in this domain, suitable for accommodating 5hmC and g5hmC. We consider it likely, therefore, that base-flipping is part of the recognition and cleavage mechanism of AbaSI, but that our structures represent an earlier, pre-flipped stage, prior to actual recognition.
Quantitative reverse transcriptase polymerase chain reaction (qRT–PCR) is a commonly employed gene expression quantification technique. This requires the development of appropriately targeted oligonucleotide primers, which necessitates the identification of ideal amplicons, development of optimized oligonucleotide sequences under most favorable pre-determined reaction conditions, and management of the resultant target-oligonucleotide pair information for each gene to be studied. The Primer3 utility exists for development of oligonucleotide primers and fills that role effectively. However, the manual process of identifying target sites and individually generating primers is inefficient and prone to user-introduced error, especially when a large number of genes are to be examined. We have developed MultiPriDe (Multiple Primer Design), a Perl utility that accepts batch lists of Gene database identifiers, collects available intron and exon position data critical to qRT–PCR primer development, and supplies these sites as identified targets for the Primer3 utility. This automated ‘gene to primer’ procedure is coupled with a set of optimized hybridization conditions used by the Primer3 utility to maximize successful primer design. MultiPriDe and assembled repeat libraries are available upon request. Please direct requests to aziesel@emory.edu.
A complex of two related mammalian proteins, SFPQ and NONO, promotes DNA double-strand break repair via the canonical nonhomologous end joining (c-NHEJ) pathway. However, its mechanism of action is not fully understood. Here we describe an improved SFPQ•NONO-dependent in vitro end joining assay. We use this system to demonstrate that the SFPQ•NONO complex substitutes in vitro for the core c-NHEJ factor, XLF. Results are consistent with a model where SFPQ•NONO promotes sequence-independent pairing of DNA substrates, albeit in a way that differs in detail from XLF. Although SFPQ•NONO and XLF function redundantly in vitro, shRNA-mediated knockdown experiments indicate that NONO and XLF are both required for efficient end joining and radioresistance in cell-based assays. In addition, knockdown of NONO sensitizes cells to the interstrand crosslinking agent, cisplatin, whereas knockdown of XLF does not, and indeed suppresses the effect of NONO deficiency. These findings suggest that each protein has one or more unique activities, in addition to the DNA pairing revealed in vitro, that contribute to DNA repair in the more complex cellular milieu. The SFPQ•NONO complex contains an RNA binding domain, and prior work has demonstrated diverse roles in RNA metabolism. It is thus plausible that the additional repair function of NONO, revealed in cell-based assays, could involve RNA interaction.
Transcription factor Krüppel-like factor 4 (Klf4), one of the factors directing cellular reprogramming, recognizes the CpG dinucleotide (whether methylated or unmodified) within a specific G/C-rich sequence. The binding affinity of the mouse Klf4 DNA-binding domain for methylated DNA is only slightly stronger than that for an unmodified oligonucleotide. The structure of the C-terminal three Krüppel-like zinc fingers (ZnFs) of mouse Klf4, in complex with fully methylated DNA, was determined at 1.85 Å resolution. An arginine and a glutamate interact with the methyl group. By comparison with two other recently characterized structures of ZnF protein complexes with methylated DNA, we propose a common principle of recognition of methylated CpG by C2H2 ZnF proteins, which involves a spatially conserved Arg–Glu pair.
HinP1I, a type II restriction endonuclease, recognizes and cleaves a palindromic tetranucleotide sequence (G↓CGC) in double-stranded DNA, producing 2 nt 5′ overhanging ends. Here, we report the structure of HinP1I crystallized as one protein monomer in the crystallographic asymmetric unit. HinP1I displays an elongated shape, with a conserved catalytic core domain containing an active-site motif of SDX18QXK and a putative DNA-binding domain. Without significant sequence homology, HinP1I displays striking structural similarity to MspI, an endonuclease that cleaves a similar palindromic DNA sequence (C↓CGG) and binds to that sequence crystallographically as a monomer. Almost all the structural elements of MspI can be matched in HinP1I, including both the DNA recognition and catalytic elements. Examining the protein–protein interactions in the crystal lattice, HinP1I could be dimerized through two helices located on the opposite side of the protein to the active site, generating a molecule with two active sites and two DNA-binding surfaces opposite one another on the outer surfaces of the dimer. A possible functional link between this unusual dimerization mode and the tetrameric restriction enzymes is discussed.