Identifying recent adaptations in large-scale genomic data.

Department of Biology, MIT, Cambridge, MA 02139, USA
Cell (Impact Factor: 33.12). 02/2013; 152(4):703-13. DOI: 10.1016/j.cell.2013.01.035
Source: PubMed

ABSTRACT Although several hundred regions of the human genome harbor signals of positive natural selection, few of the relevant adaptive traits and variants have been elucidated. Using full-genome sequence variation from the 1000 Genomes (1000G) Project and the composite of multiple signals (CMS) test, we investigated 412 candidate signals and leveraged functional annotation, protein structure modeling, epigenetics, and association studies to identify and extensively annotate candidate causal variants. The resulting catalog provides a tractable list for experimental follow-up; it includes 35 high-scoring nonsynonymous variants, 59 variants associated with expression levels of a nearby coding gene or lincRNA, and numerous variants associated with susceptibility to infectious disease and other phenotypes. We experimentally characterized one candidate nonsynonymous variant in Toll-like receptor 5 (TLR5) and show that it leads to altered NF-κB signaling in response to bacterial flagellin. PAPERFLICK:

1 Follower
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Genome-wide scans for selection have identified multiple regions of the human genome as being targeted by positive selection. However, only a small proportion has been replicated across studies, and the prevalence of positive selection as a mechanism of adaptive change in humans remains controversial. Here we explore the power of two haplotype-based statistics - the integrated haplotype score (iHS) and the Derived Intra-allelic Nucleotide Diversity (DIND) test - in the context of next-generation sequencing data, and evaluate their robustness to demography and other selection modes. We show that these statistics are both powerful for the detection of recent positive selection, regardless of population history, and robust to variation in coverage, with DIND being insensitive to very low coverage. We apply these statistics to whole-genome sequence datasets from the 1000 Genomes Project and Complete Genomics. We found that putative targets of selection were highly significantly enriched in genic and non-synonymous SNPs, and that DIND was more powerful than iHS in the context of small sample sizes, low-quality genotype calling or poor coverage. As we excluded genomic confounders and alternative selection models, such as background selection, the observed enrichment attests to the action of recent, strong positive selection. Further support to the adaptive significance of these genomic regions came from their enrichment in functional variants detected by genome-wide association studies, informing the relationship between past selection and current benign and disease-related phenotypic variation. Our results indicate that hard sweeps targeting low-frequency standing variation have played a moderate, albeit significant, role in recent human evolution.
    Molecular Biology and Evolution 04/2014; DOI:10.1093/molbev/msu118 · 14.31 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The reasons for the ethnic disparities in the prevalence of systemic lupus erythematosus (SLE) and the relative high frequency of SLE risk alleles in the population are not fully understood. Population genetic factors such as natural selection alter allele frequencies over generations and may help explain the persistence of such common risk variants in the population and the differential risk of SLE. In order to better understand the genetic basis of SLE that might be due to natural selection, a total of 74 genomic regions with compelling evidence for association with SLE were tested for evidence of recent positive selection in the HapMap and HGDP populations, using population differentiation, allele frequency, and haplotype-based tests. Consistent signs of positive selection across different studies and statistical methods were observed at several SLE-associated loci, including PTPN22, TNFSF4, TET3-DGUOK, TNIP1, UHRF1BP1, BLK, and ITGAM genes. This study is the first to evaluate and report that several SLE-associated regions show signs of positive natural selection. These results provide corroborating evidence in support of recent positive selection as one mechanism underlying the elevated population frequency of SLE risk loci and supports future research that integrates signals of natural selection to help identify functional SLE risk alleles.
    01/2014; 2014:203435. DOI:10.1155/2014/203435
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: It has recently been hypothesized that polygenic adaptation, resulting in modest allele frequency changes at many loci, could be a major mechanism behind the adaptation of complex phenotypes in human populations. Here we leverage the large number of variants that have been identified through genome-wide association (GWA) studies to comprehensively study signatures of natural selection on genetic variants associated with complex traits. Using population differentiation based methods, such as FST and phylogenetic branch length analyses, we systematically examined nearly 1300 SNPs associated with 38 complex phenotypes. Instead of detecting selection signatures at individual variants, we aimed to identify combined evidence of natural selection by aggregating signals across many trait associated SNPs. Our results have revealed some general features of polygenic selection on complex traits associated variants. First, natural selection acting on standing variants associated with complex traits is a common phenomenon. Second, characteristics of selection for different polygenic traits vary both temporarily and geographically. Third, some studied traits (e.g. height and urate level) could have been the primary targets of selection, as indicated by the significant correlation between the effect sizes and the estimated strength of selection in the trait associated variants; however, for most traits, the allele frequency changes in trait associated variants might have been driven by the selection on other correlated phenotypes. Fourth, the changes in allele frequencies as a result of selection can be highly stochastic, such that, polygenic adaptation may accelerate differentiation in allele frequencies among populations, but generally does not produce predictable directional changes. Fifth, multiple mechanisms (pleiotropy, hitchhiking, etc) may act together to govern the changes in allele frequencies of genetic variants associated with complex traits.
    12/2013; 2. DOI:10.1016/j.atg.2013.10.002