Assessing Side-Chain Perturbations of the Protein Backbone: A Knowledge-Based Classification of Residue Ramachandran Space

Department of Statistics, Texas A&M University, College Station, TX 77843, USA.
Journal of Molecular Biology (Impact Factor: 3.96). 06/2008; 378(3):749-58. DOI: 10.1016/j.jmb.2008.02.043
Source: PubMed

ABSTRACT Grouping the 20 residues is a classic strategy to discover ordered patterns and insights about the fundamental nature of proteins, their structure, and how they fold. Usually, this categorization is based on the biophysical and/or structural properties of a residue's side-chain group. We extend this approach to understand the effects of side chains on backbone conformation and to perform a knowledge-based classification of amino acids by comparing their backbone phi, psi distributions in different types of secondary structure. At this finer, more specific resolution, torsion angle data are often sparse and discontinuous (especially for nonhelical classes) even though a comprehensive set of protein structures is used. To ensure the precision of Ramachandran plot comparisons, we applied a rigorous Bayesian density estimation method that produces continuous estimates of the backbone phi, psi distributions. Based on this statistical modeling, a robust hierarchical clustering was performed using a divergence score to measure the similarity between plots. There were seven general groups based on the clusters from the complete Ramachandran data: nonpolar/beta-branched (Ile and Val), AsX (Asn and Asp), long (Met, Gln, Arg, Glu, Lys, and Leu), aromatic (Phe, Tyr, His, and Cys), small (Ala and Ser), bulky (Thr and Trp), and, lastly, the singletons of Gly and Pro. At the level of secondary structure (helix, sheet, turn, and coil), these groups remain somewhat consistent, although there are a few significant variations. Besides the expected uniqueness of the Gly and Pro distributions, the nonpolar/beta-branched and AsX clusters were very consistent across all types of secondary structure. Effectively, this consistency across the secondary structure classes implies that side-chain steric effects strongly influence a residue's backbone torsion angle conformation. These results help to explain the plasticity of amino acid substitutions on protein structure and should help in protein design and structure evaluation.


Available from: Marina Vannucci, Jul 26, 2014
  • [Show abstract] [Hide abstract]
    ABSTRACT: Cholera toxin (CT) is an AB5 protein complex secreted by the pathogen Vibrio cholera, which is responsible for cholera infection. N-acetylneuraminic acid (NeuNAc) is a derivative of neuraminic acid with nine-carbon backbone. NeuNAc is distributed on the cell surface mainly located in the terminal components of glycoconjugates, and also plays an important role in cell-cell interaction. In our current study, molecular docking and molecular dynamic (MD) simulations were implemented to identify the potent NeuNAc analogs with high-inhibitory activity against CT protein. Thirty-four NeuNAc analogs, modified in different positions C-1/C-2/C-4/C-5/C-7/C-8/C-9, were modeled and docked against the active site of CT protein. Among the 34 NeuNAc analogs, the analog Neu5Gc shows the least extra precision glide score of -9.52 and glide energy of -44.71 kcal/mol. NeuNAc analogs block the CT active site residues HIS:13, ASN:90, LYS:91, GLN:56, GLN:61, and TRP:88 through intermolecular hydrogen bonding. The MD simulation for CT-Neu5Gc docking complex was performed using Desmond. MD simulation of CT-Neu5Gc complex reveals the stable nature of docking interaction.
    Journal of biomolecular Structure & Dynamics 07/2014; 33(5):1-14. DOI:10.1080/07391102.2014.931825 · 2.98 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: On the occasion of their fiftieth birthday, it is opportune to review the first half century of Ramachandran plots. In the present review, some of the most relevant aspects of this fifty-year history are summarized, from the original ideas of Gopalasamudram Narayana Ramachandran to subsequent revisions and to applications in structural biology. This will not be a guided walk through five decades of Ramachandran plots, but a commented summary of the lines along which the original ideas evolved and continue to develop, and of their applications.
    Acta Crystallographica Section D Biological Crystallography 08/2013; 69(Pt 8):1333-41. DOI:10.1107/S090744491301158X · 7.23 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The most informative probability distribution functions (PDFs) describing the Ramachandran phi-psi dihedral angle pair, a fundamental descriptor of backbone conformation of protein molecules, are derived from high-resolution X-ray crystal structures using an information-theoretic approach. The Information Maximization Device (IMD) is established, based on fundamental information-theoretic concepts, and then applied specifically to derive highly resolved phi-psi maps for all 20 single amino acid and all 8000 triplet sequences at an optimal resolution determined by the volume of current data. The paper shows that utilizing the latent information contained in all viable high-resolution crystal structures found in the Protein Data Bank (PDB), totaling more than 77,000 chains, permits the derivation of a large number of optimized sequence-dependent PDFs. This work demonstrates the effectiveness of the IMD and the superiority of the resulting PDFs by extensive fold recognition experiments and rigorous comparisons with previously published triplet PDFs. Because it automatically optimizes PDFs, IMD results in improved performance of knowledge-based potentials, which rely on such PDFs. Furthermore, it provides an easy computational recipe for empirically deriving other kinds of sequence-dependent structural PDFs with greater detail and precision. The high-resolution phi-psi maps derived in this work are available for download.
    PLoS ONE 06/2014; 9(6):e94334. DOI:10.1371/journal.pone.0094334 · 3.53 Impact Factor