Quantitative residue-level structure-evolution relationships in the yeast membrane proteome.

Bioinformatics Program, Department of Chemistry, Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA.
Genome Biology and Evolution (Impact Factor: 4.53). 03/2013; DOI: 10.1093/gbe/evt039
Source: PubMed

ABSTRACT Membrane proteins exist in distinctly different environments from soluble proteins, resulting in differences between their respective biophysical and evolutionary properties. In comparison with soluble proteins, relatively little is known about how the unique biophysical properties of membrane proteins affect their evolutionary properties at the residue level. In particular, transmembrane regions of membrane proteins tend to be more conserved than regions outside of the membrane (extramembrane regions), but the mechanisms underlying this phenomenon are not well-understood. Here, we combine homology-based high-resolution three-dimensional protein models with rigorous evolutionary rate calculations to quantitatively assess residue-level structure-evolution relationships in the yeast membrane proteome. We find that residue evolutionary rate increases linearly with decreasing residue burial, regardless of the hydrophobic or hydrophilic nature of the solvent environment. This finding supports a direct relationship between a residue's selective constraint and the extent of its packing interactions with neighboring residues, independent of hydrophobic effects. Most importantly, for a fixed degree of burial, residues from transmembrane regions tend to evolve more slowly than residues from extramembrane regions. We attribute this difference to the increased importance of packing constraints and the decreased importance of hydrophobic effects in transmembrane regions. This additional selective constraint on transmembrane residues plays a dominant role in explaining why transmembrane regions evolve more slowly than extramembrane regions. In addition to revealing the universality of the linear relationship between residue burial and selective constraint across solvent environments, our work highlights the distinct residue-level evolutionary consequences imposed by the unique biophysical properties of the membrane environment.

  • [Show abstract] [Hide abstract]
    ABSTRACT: Based on the rates of amino acid substitution for 60 mammalian genes of 50 codons or more, it is shown that the rate of amino acid substitution of a protein is correlated with its amino acid composition. In particular, the content of glycine residues is negatively correlated with the rate of amino acid substitution, and this content alone explains about 38% of the total variation in amino acid substitution rates among different protein families. The propensity of a polypeptide to evolve fast or slowly may be predicted from an index or indices of protein mutability directly derivable from the amino acid composition. The propensity of an amino acid to remain conserved during evolutionary times depends not so much on its being featured prominently in active sites, but on its stability index, defined as the mean chemical distance [R. Grantham (1974) Science 185:862-864] between the amino acid and its mutational derivatives produced by single-nucleotide substitutions. Functional constraints related to active and binding sites of proteins play only a minor role in determining the overall rate of amino acid substitution. The importance of amino acid composition in determining rates of substitution is illustrated with examples involving cytochrome c, cytochrome b5, ras-related genes, the calmodulin protein family, and fibrinopeptides.
    Journal of Molecular Evolution 02/1985; 22(1):53-62. · 1.86 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: A computer program that progressively evaluates the hydrophilicity and hydrophobicity of a protein along its amino acid sequence has been devised. For this purpose, a hydropathy scale has been composed wherein the hydrophilic and hydrophobic properties of each of the 20 amino acid side-chains is taken into consideration. The scale is based on an amalgam of experimental observations derived from the literature. The program uses a moving-segment approach that continuously determines the average hydropathy within a segment of predetermined length as it advances through the sequence. The consecutive scores are plotted from the amino to the carboxy terminus. At the same time, a midpoint line is printed that corresponds to the grand average of the hydropathy of the amino acid compositions found in most of the sequenced proteins. In the case of soluble, globular proteins there is a remarkable correspondence between the interior portions of their sequence and the regions appearing on the hydrophobic side of the midpoint line, as well as the exterior portions and the regions on the hydrophilic side. The correlation was demonstrated by comparisons between the plotted values and known structures determined by crystallography. In the case of membrane-bound proteins, the portions of their sequences that are located within the lipid bilayer are also clearly delineated by large uninterrupted areas on the hydrophobic side of the midpoint line. As such, the membrane-spanning segments of these proteins can be identified by this procedure. Although the method is not unique and embodies principles that have long been appreciated, its simplicity and its graphic nature make it a very useful tool for the evaluation of protein structures.
    Journal of Molecular Biology 06/1982; 157(1):105-32. · 3.96 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Because of their wide use in molecular modeling, methods to compute molecular surfaces have received a lot of interest in recent years. However, most of the proposed algorithms compute the analytical representation of only the solvent-accessible surface. There are a few programs that compute the analytical representation of the solvent-excluded surface, but they often have problems handling singular cases of self-intersecting surfaces and tend to fail on large molecules (more than 10,000 atoms). We describe here a program called MSMS, which is shown to be fast and reliable in computing molecular surfaces. It relies on the use of the reduced surface that is briefly defined here and from which the solvent-accessible and solvent-excluded surfaces are computed. The four algorithms composing MSMS are described and their complexity is analyzed. Special attention is given to the handling of self-intersecting parts of the solvent-excluded surface called singularities. The program has been compared with Connolly's program PQMS [M. L. Connolly (1993) Journal of Molecular Graphics, Vol. 11, pp. 139–141] on a set of 709 molecules taken from the Brookhaven Data Base. MSMS was able to compute topologically correct surfaces for each molecule in the set. Moreover, the actual time spent to compute surfaces is in agreement with the theoretical complexity of the program, which is shown to be O[n log(n)] for n atoms. On a Hewlett-Packard 9000/735 workstation, MSMS takes 0.73 s to produce a triangulated solvent-excluded surface for crambin (1crn, 46 residues, 327 atoms, 4772 triangles), 4.6 s for thermolysin (3tln, 316 residues, 2437 atoms, 26462 triangles), and 104.53 s for glutamine synthetase (2gls, 5676 residues, 43632 atoms, 476665 triangles). © 1996 John Wiley & Sons, Inc.
    Biopolymers 02/1996; 38(3):305 - 320. · 2.29 Impact Factor


Available from