Fay JC, Wu CI. Sequence divergence, functional constraint, and selection in protein evolution. Annu Rev Genomics Hum Genet 4: 213-235

Lawrence Berkeley Laboratory, University of California, Berkeley, Berkeley, California, United States
Annual Review of Genomics and Human Genetics (Impact Factor: 8.96). 02/2003; 4:213-35. DOI: 10.1146/annurev.genom.4.020303.162528
Source: PubMed


The genome sequences of multiple species has enabled functional inferences from comparative genomics. A primary objective is to infer biological functions from the conservation of homologous DNA sequences between species. A second, more difficult, objective is to understand what functional DNA sequences have changed over time and are responsible for species' phenotypic differences. The neutral theory of molecular evolution provides a theoretical framework in which both objectives can be explicitly tested. Development of statistical tests within this framework has provided insight into the evolutionary forces that constrain and in some cases change DNA sequences and the resulting patterns that emerge. In this article, we review recent work on how functional constraint and changes in protein function are inferred from protein polymorphism and divergence data. We relate these studies to our understanding of the neutral theory and adaptive evolution.

Download full-text


Available from: Justin C Fay
    • "Therefore, a mutation is said to be adaptive if it performs a function that is in some way advantageous in the population. Negative selection favors the conservation of existing phenotypes or particular amino acid residues functionally constrained, playing an important role in maintaining the long-term stability of biological function of the proteins [65] "
    [Show abstract] [Hide abstract]
    ABSTRACT: Antimicrobial peptides and proteins (AMPs) are widespread in the living kingdom. They are key effectors of defense reactions and mediators of competitions between organisms. They are often cationic and amphiphilic, which favors their interactions with the anionic membranes of microorganisms. Several AMP families do not directly alter membrane integrity but rather target conserved components of the bacterial membranes in a process that provides them with potent and specific antimicrobial activities. Thus, lipopolysaccharides (LPS), lipoteichoic acids (LTA) or the peptidoglycan precursor Lipid II is targeted by a broad series of AMPs. Studying the functional diversity of immune effectors tells us about the essential residues involved in AMP mechanism of action. Marine invertebrates have been found to produce a remarkable diversity of AMPs. Molluscan defensins and crustacean anti-LPS factors (ALF) are diverse in terms of amino acid sequence and show contrasted phenotypes in terms of antimicrobial activity. Their activity is directed essentially against Gram-positive or Gram-negative bacteria due to their specific interactions with Lipid II or Lipid A, respectively. Through those interesting examples, we discuss here how sequence diversity generated throughout evolution informs us on residues required for essential molecular interaction at the bacterial membranes and subsequent antibacterial activity. Through the analysis of molecular variants having lost antibacterial activity or shaped novel functions, we also discuss the molecular bases of functional divergence in AMPs. This article is part of a Special Issue entitled: Antimicrobial peptides edited by Karl Lohner and Kai Hilpert.
    No preview · Article · Oct 2015 · Biochimica et Biophysica Acta
    • "Therefore, a mutation is said to be adaptive if it performs a function that is in some way advantageous in the population. Negative selection favors the conservation of existing phenotypes or particular amino acid residues functionally constrained, playing an important role in maintaining the long-term stability of biological function of the proteins [65] "
    [Show abstract] [Hide abstract]
    ABSTRACT: Recent studies revealed that several vibrio species have evolved the capacity to survive inside host cells. However, it is still often ignored if intracellular stages are required for pathogenicity. Virulence of Vibrio tasmaniensis LGP32, a strain pathogenic for Crassostrea gigas oysters, depends on entry into hemocytes, the oyster immune cells. We investigated here the mechanisms of LGP32 intracellular survival and their consequences on the host-pathogen interaction. Entry and survival inside hemocytes were required for LGP32-driven cytolysis of hemocytes, both in vivo and in vitro. LGP32 intracellular stages showed a profound boost in metabolic activity and a major transcription of antioxidant and copper detoxification genes, as revealed by RNA sequencing. LGP32 isogenic mutants showed that resistance to oxidative stress and copper efflux are two main functions required for vibrio intracellular stages and cytotoxicity to hemocytes. Copper efflux was also essential for host colonization and virulence in vivo. Altogether our results identify copper resistance as a major mechanism to resist killing by phagocytes, induce cytolysis of immune cells and colonize oysters. Selection of such resistance traits could arise from vibrio interactions with copper-rich environmental niches including marine invertebrates, which favor the emergence of pathogenic vibrios resistant to intraphagosomal killing across animal species.
    No preview · Article · Oct 2015 · Environmental Microbiology
    • "Similarly, for Leu (UUR), the RSCU was 2.63 for UUA and 1.62 for UUG. The estimation of nonsynonymous (Ka) and synonymous (Ks) substitution rates is quite useful for understanding the selective constraints acting on the protein-coding sequences across closely related species (Ohta, 1995; Fay and Wu, 2003). In order to detect the influence of selection pressure in Arcidae species, the numbers of Ka, Ks and their ratios were calculated for all pairwise comparisons among the four Arcidae (Supplementary Table 2). "
    [Show abstract] [Hide abstract]
    ABSTRACT: The mitochondrial (mt) genome is a significant tool for investigating the evolutionary history of metazoan animals. The family Arcidae belongs to the superfamily Arcacea in the bivalve order Arcoida, comprising about 260 species. Currently, three complete mitochondrial genomes are available in GenBank, representing 1 subfamily and 2 genera. Here we present the complete mitochondrial genome sequence of Anadara vellicata (Bivalvia: Arcidae), the first report of complete mitogenome from Anadara, Arcidae, and compared its sequence with other available Arcidae mitogenomes. The A. vellicata mitogenome is 34,147bp in length, including 12 protein-coding genes (PCGs), 25 transfer RNAs (tRNAs), 2 ribosomal RNA (rRNA) genes and non-coding regions (NCR) (20,722bp). The nucleotide composition of the genome is A+T biased, accounting for 61.03%, with negative AT skew (-0.12) and positive GC skew (0.41). We report the evidence of alloacceptor tRNA gene recruitment (trnY-trnL2). A conserved 23bp-long sequence was used as the basis to infer the 3' terminus of rrnS. Most of the non-coding sequences (16,112bp) are observed within one segment. In the NCR, the tandem repeat (TR) region is 1143bp, comprising six tandem repeats with 189bp to 192bp in length. In addition, a long thymine-nucleotide stretch (T-stretch) was detected in the NCR of A. vellicata. The gene order and transcriptional polarity of the protein-coding genes is identical to other Arcidae species. tRNA genes are rearranged, making the gene order unique. The results support that mt gene arrangement among Arcidae species is not random, but correlated with their evolutionary relationships. Copyright © 2015 Elsevier Inc. All rights reserved.
    No preview · Article · Aug 2015 · Comparative Biochemistry and Physiology Part D Genomics and Proteomics
Show more