Conformation Dependence of Backbone Geometry in Proteins

Department of Biochemistry and Biophysics, Oregon State University, Corvallis, OR 97331, USA.
Structure (Impact Factor: 6.79). 10/2009; 17(10):1316-25. DOI: 10.1016/j.str.2009.08.012
Source: PubMed

ABSTRACT Protein structure determination and predictive modeling have long been guided by the paradigm that the peptide backbone has a single, context-independent ideal geometry. Both quantum-mechanics calculations and empirical analyses have shown this is an incorrect simplification in that backbone covalent geometry actually varies systematically as a function of the phi and Psi backbone dihedral angles. Here, we use a nonredundant set of ultrahigh-resolution protein structures to define these conformation-dependent variations. The trends have a rational, structural basis that can be explained by avoidance of atomic clashes or optimization of favorable electrostatic interactions. To facilitate adoption of this paradigm, we have created a conformation-dependent library of covalent bond lengths and bond angles and shown that it has improved accuracy over existing methods without any additional variables to optimize. Protein structures derived from crystallographic refinement and predictive modeling both stand to benefit from incorporation of the paradigm.

  • [Show abstract] [Hide abstract]
    ABSTRACT: Relative amino acid residue solvent accessibility values allow the quantitative comparison of atomic solvent-accessible surface areas in different residue types and physical environments in proteins and in protein structural alignments. Geometry-optimised tri-peptide structures in extended solvent-exposed reference conformations have been obtained for 43 amino acid residue types at a high level of quantum chemical theory. Significant increases in side-chain solvent accessibility, offset by reductions in mainchain atom solvent exposure, were observed for standard residue types in partially geometry-optimised structures when compared to non-minimised models built from identical sets of proper dihedral angles abstracted from the literature. Optimisation of proper dihedral angles led most notably to marked increases of up to 54% in proline main-chain atom solvent accessibility compared to literature values. Similar effects were observed for fully-optimised tri-peptides in implicit solvent. The relief of internal strain energy was associated with systematic variation in N, Ca and Cb atom solvent accessibility across all standard residue types. The results underline the importance of optimisation of ‘hard’ degrees of freedom (bond lengths and valence bond angles) and improper dihedral angle values from force field or other context-independent reference values, and impact on the use of standardised fixed internal co-ordinate geometry in sampling approaches to the determination of absolute values of protein amino acid residue solvent accessibility. Quantum chemical methods provide a useful and accurate alternative to molecular mechanics methods to perform energy minimisation of peptides containing non-standard (chemically modified) amino acid residues frequently present in experimental protein structure data sets, for which force field parameters may not be available. Reference tri-peptide atomic co-ordinate sets including hydrogen atoms are made freely available.
    Computational Biology and Chemistry 02/2015; 54:33-43. DOI:10.1016/j.compbiolchem.2014.11.007 · 1.60 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Ideal values of bond angles and lengths used as external restraints are crucial for the successful refinement of protein crystal structures at all but the highest of resolutions. The restraints in common usage today have been designed based on the assumption that each type of bond or angle has a single ideal value independent of context. However, recent work has shown that the ideal values are, in fact, sensitive to local conformation, and as a first step toward using such information to build more accurate models, ultra-high resolution protein crystal structures have been used to derive a conformation-dependent library (CDL) of restraints for the protein backbone (Berkholz et al. 2009. Structure. 17, 1316). Here, we report the introduction of this CDL into the Phenix package and the results of test refinements of thousands of structures across a wide range of resolutions. These tests show that use of the conformation dependent library yields models that have substantially better agreement with ideal main-chain bond angles and lengths and, on average, a slightly enhanced fit to the X-ray data. No disadvantages of using the backbone CDL are apparent. In Phenix usage of the CDL can be selected by simply specifying the cdl=True option. This successful implementation paves the way for further aspects of the context-dependence of ideal geometry to be characterized and applied to improve experimental and predictive modelling accuracy.This article is protected by copyright. All rights reserved.
    FEBS Journal 05/2014; 281(18). DOI:10.1111/febs.12860 · 3.99 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Proteins are long-chained biomolecules with distinctive functions, that take a major role in all living systems. The function is defined by the protein structure, which in turn is determined via a complicated mechanism based on the amino acid sequence. The exact procedure is not fully understood. However, knowing the structure is important for the pharmaceutical industry as well as bioengineering and nanotechnology. Unfortunately, determining it experimentally is slow and expensive. There is also much interest in being able to adapt the sequence to make stable industrial enzymes or to form molecules with specialised shapes, e.g. for biosensors. Predicting a structure computationally from the sequence is a classic problem in theoretical biochemistry, that has not been solved yet. In this work the emphasis lies in methodological improvements, that avoid common chemical preconceptions. A general method for building numerical model s is developed and analysed here. It is based on a statistical correlation scheme of sequence and structure using ideas from self-consistent mean field (SCMF) optimisation. The procedure is successfully applied to the structure prediction and sequence design problems without using a Boltzmann formalism. The statistical model is based on a mixture distribution of bivariate Gaussian and 20-way Bernoulli distributions. The Gaussian distributions model the continuous variables of the structure (dihedral angles) and the Bernoulli distributions capture the sequence propensities. Instead of treating the protein as a statistical unit, easier to handle fragments are used. Several approaches to recombine them are discussed. But the fragments form local statistical units that do not necessarily agree with each other. A method suited to deal with such inconsistencies is SCMF optimisation. Mean field or SCMF methods optimise a system by treating all solution states at the same time. In existing approaches, an energy potential was introduced that reflects the pairwise mean interaction between subsystems. The state weights of the subsystems were converted alternately into energies and probabilities by applying the Boltzmann relation repeatedly until a self-consistent state for the whole system is reached. With the approach presented here it is possible to optimise the state probabilities directly. The Boltzmann distribution is essentially an unnecessary assumption. Therefore, the method is also applicable to systems with an unknown ensemble.
    05/2012, Degree: Doctor rerum naturalium, Supervisor: Andrew E. Torda

Full-text (2 Sources)

Available from
May 29, 2014