Article

PRIDB: a Protein-RNA Interface Database.

Bioinformatics and Computational Biology Program, Iowa State University, Department of Genetics, Development and Cell Biology, Iowa State University, Department of Computer Science, Iowa State University, Ames, IA 50011, Department of Biology, Elon University, Elon, NC 27244 and Computational Systems Biology Summer Institute, Iowa State University, Ames, IA 50011, USA.
Nucleic Acids Research (Impact Factor: 8.28). 11/2010; 39(Database issue):D277-82. DOI:10.1093/nar/gkq1108
Source: PubMed

ABSTRACT The Protein-RNA Interface Database (PRIDB) is a comprehensive database of protein-RNA interfaces extracted from complexes in the Protein Data Bank (PDB). It is designed to facilitate detailed analyses of individual protein-RNA complexes and their interfaces, in addition to automated generation of user-defined data sets of protein-RNA interfaces for statistical analyses and machine learning applications. For any chosen PDB complex or list of complexes, PRIDB rapidly displays interfacial amino acids and ribonucleotides within the primary sequences of the interacting protein and RNA chains. PRIDB also identifies ProSite motifs in protein chains and FR3D motifs in RNA chains and provides links to these external databases, as well as to structure files in the PDB. An integrated JMol applet is provided for visualization of interacting atoms and residues in the context of the 3D complex structures. The current version of PRIDB contains structural information regarding 926 protein-RNA complexes available in the PDB (as of 10 October 2010). Atomic- and residue-level contact information for the entire data set can be downloaded in a simple machine-readable format. Also, several non-redundant benchmark data sets of protein-RNA complexes are provided. The PRIDB database is freely available online at http://bindr.gdcb.iastate.edu/PRIDB.

0 0
 · 
0 Bookmarks
 · 
102 Views
  • [show abstract] [hide abstract]
    ABSTRACT: Understanding the details of protein-RNA interactions is important to reveal the functions of both the RNAs and the proteins. In these interactions the secondary structures of the RNAs play an important role. Because RNA secondary structures in protein-RNA complexes are variable, considering the ensemble of RNA secondary structures is a useful approach. In particular, recent studies have supported the idea that, in the analysis of RNA secondary structures, the base-pairing probabilities of RNAs (i.e., the probabilities of forming a base-pair in the ensemble of RNA secondary structures) provide richer and more robust information about the structures than a single RNA secondary structure, for example, the minimum free energy (MFE) structure or a snapshot of structures in the PDB. However, there has been no investigation of the base-paring probabilities in protein-RNA interactions. In this study, we analyzed base-pairing probabilities of RNA molecules involved in known protein-RNA complexes in the PDB. Our analysis suggests that, in the tertiary structures, the base-pairing probabilities (which are computed using only sequence information) for unpaired nucleotides with intermolecular hydrogen bonds to amino acids were significantly lower than those for unpaired nucleotides without hydrogen bonds. On the other hand, no difference was found between the base-pairing probabilities for paired nucleotides with and without intermolecular hydrogen bonds. Those findings were commonly supported by three probabilistic models, which provide the ensemble of RNA secondary structures, including the McCaskill model based on Turner's free energy of secondary structures. iwakiri@cb.k.u-tokyo.ac.jp, mhamada@cb.k.u-tokyo.ac.jp.
    Bioinformatics 08/2013; · 5.47 Impact Factor
  • Source
    [show abstract] [hide abstract]
    ABSTRACT: Though most of the transcripts are long non-coding RNAs (lncRNAs), little is known about their functions. lncRNAs usually function through interactions with proteins, which implies the importance of identifying the binding proteins of lncRNAs in understanding the molecular mechanisms underlying the functions of lncRNAs. Only a few approaches are available for predicting interactions between lncRNAs and proteins. In this study, we introduce a new method lncPro. By encoding RNA and protein sequences into numeric vectors, we used matrix multiplication to score each RNA--protein pair. This score can be used to measure the interactions between an RNA--protein pair. This method effectively discriminates interacting and non-interacting RNA--protein pairs and predicts RNA--protein interactions within a given complex. Applying this method on all human proteins, we found that the long non-coding RNAs we collected tend to interact with nuclear proteins and RNA-binding proteins. Compared with the existing approaches, our method shortens the time for training matrix and obtains optimal results based on the model being used. The ability of predicting the associations between lncRNAs and proteins has also been enhanced. Our method provides an idea on how to integrate different information into the prediction process.
    BMC Genomics 09/2013; 14(1):651. · 4.40 Impact Factor
  • [show abstract] [hide abstract]
    ABSTRACT: In this work, we have analyzed the influence of cation-π interactions to the stability of 59 high resolution protein-RNA complex crystal structures. The total number of Lys and Arg are similar in the dataset as well as the number of their interactions. On the other hand, the aromatic chains of purines are exhibiting more cation-π interactions than pyrimidines. 35% of the total interactions in the dataset are involved in the formation of multiple cation-π interactions. The multiple cation-π interactions have been conserved more than the single interactions. The analysis of the geometry of the cation-π interactions has revealed that the average distance (d) value falls into distinct ranges corresponding to the multiple (4.28Å) and single (5.50Å) cation-π interactions. The G-Arg pair has the strongest interaction energy of -3.68kcalmol(-1) among all the possible pairs of amino acids and bases. Further, we found that the cation-π interactions due to five-membered rings of A and G are stronger than that with the atoms in six-membered rings. 8.7% stabilizing residues are involved in building cation-π interactions with the nucleic bases. There are three types of structural motifs significantly over-represented in protein-RNA interfaces: beta-turn-ir, niche-4r and st-staple. Tetraloops and kink-turns are the most abundant RNA motifs in protein-RNA interfaces. Amino acids deployed in the protein-RNA interfaces are deposited in helices, sheets and coils. Arg and Lys, involved in cation-π interactions, prefer to be in the solvent exposed surface. The results from this study might be used for structure-based prediction and as scaffolds for future protein-RNA complex design.
    Computational biology and chemistry 08/2013; 47C:105-112. · 1.37 Impact Factor

Full-text (2 Sources)

View
24 Downloads
Available from
Sep 18, 2012