Sequence divergence, functional constraint, and selection in protein evolution.

Department of Genome Sciences, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA.
Annual Review of Genomics and Human Genetics (Impact Factor: 9.13). 02/2003; 4:213-35. DOI: 10.1146/annurev.genom.4.020303.162528
Source: PubMed

ABSTRACT The genome sequences of multiple species has enabled functional inferences from comparative genomics. A primary objective is to infer biological functions from the conservation of homologous DNA sequences between species. A second, more difficult, objective is to understand what functional DNA sequences have changed over time and are responsible for species' phenotypic differences. The neutral theory of molecular evolution provides a theoretical framework in which both objectives can be explicitly tested. Development of statistical tests within this framework has provided insight into the evolutionary forces that constrain and in some cases change DNA sequences and the resulting patterns that emerge. In this article, we review recent work on how functional constraint and changes in protein function are inferred from protein polymorphism and divergence data. We relate these studies to our understanding of the neutral theory and adaptive evolution.

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Evolution is driven by mutations, which lead to new protein functions but come at a cost to protein stability. Non-conservative substitutions are of interest in this regard because they may most profoundly affect both function and stability. Accordingly, organisms must balance the benefit of accepting advantageous substitutions with the possible cost of deleterious effects on protein folding and stability. We here examine factors that systematically promote non-conservative mutations at the proteome level. Intrinsically disordered regions in proteins play pivotal roles in protein interactions, but many questions regarding their evolution remain unanswered. Similarly, whether and how molecular chaperones, which have been shown to buffer destabilizing mutations in individual proteins, generally provide robustness during proteome evolution remains unclear. To this end, we introduce an evolutionary parameter λ that directly estimates the rate of non-conservative substitutions. Our analysis of λ in Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens sequences reveals how co- and post-translationally acting chaperones differentially promote non-conservative substitutions in their substrates, likely through buffering of their destabilizing effects. We further find that λ serves well to quantify the evolution of intrinsically disordered proteins even though the unstructured, thus generally variable regions in proteins are often flanked by very conserved sequences. Crucially, we show that both intrinsically disordered proteins and highly re-wired proteins in protein interaction networks, which have evolved new interactions and functions, exhibit a higher λ at the expense of enhanced chaperone assistance. Our findings thus highlight an intricate interplay of molecular chaperones and protein disorder in the evolvability of protein networks. Our results illuminate the role of chaperones in enabling protein evolution, and underline the importance of the cellular context and integrated approaches for understanding proteome evolution. We feel that the development of λ may be a valuable addition to the toolbox applied to understand the molecular basis of evolution.
    PLoS Computational Biology 06/2014; 10(6):e1003674. · 4.83 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: In the present study, we determined the mitochondrial DNA (mtDNA) sequence of three Neritas, Nerita versicolor, Nerita tessellata, and Nerita fulgurans. We present an analysis of the features of their gene content and genome organization and compare these within the genus Nerita, and among the main gastropod groups. The new sequences were used in a phylogenetic analysis including all available gastropod mitochondrial genomes. Genomic lengths were quite conserved, being 15,866 bp for N. versicolor, 15,741 bp for N. tessellata and 15,343 bp for N. fulgurans. Intergenic regions were generally short; genes are transcribed from both strands and have a nucleotide composition high in A and T. The high similarity in nucleotide content of the different sequences, gene composition, as well as an identical genomic organization among the Nerita species compared in this study, indicates a high degree of conservation within this diverse genus. Values of Ka/Ks of the 13 protein coding genes (PCGs) of Nerita species ranged from 0 to 0.18, and suggested different selection pressures in gene sequences. Bayesian phylogenetic analyses using concatenated DNA sequences of the 13 PCGs and the two rRNAs, and of amino acid sequences strongly supported Neritimorpha and Vetigastropoda as sister groups.
    Marine Genomics 06/2014; · 1.97 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The Bifidobacterium genus currently encompasses 48 recognized taxa, which have been isolated from different ecosystems. However, the current phylogeny of bifidobacteria is hampered by the relative paucity of genotypic data. Here, we re-assessed the taxonomy of this bacterial genus using genome-based approaches, which highlighted that of the previous taxonomic view of bifidobacteria contained several inconsistencies. In particular, high levels of genetic relatedness were shown to exist between particular Bifidobacterium taxa, which would not justify their status as separate species. The results presented are here based on Average Nucleotide Identity analysis involving the genome sequences for each type strain of the 48 bifidobacterial taxa, as well as phylogenetic comparative analysis of the predicted core genome of the Bifidobacterium genus. This study highlights that the availability of complete genome sequences allows the reconstruction of a more robust bifidobacterial phylogeny compared to that obtained from a single gene-based sequence comparison, thus discouraging the assignment of a new or separate bifidobacterial taxon without such a genome-based validation.
    Applied and Environmental Microbiology 08/2014; · 3.95 Impact Factor

Full-text (2 Sources)

Available from
May 19, 2014