Article

Inference of historical changes in migration rate from the lengths of migrant tracts.

Department of Statistics, University of California, Berkeley, California 94720, USA.
Genetics (Impact Factor: 4.87). 01/2009; 181(2):711-9. DOI: 10.1534/genetics.108.098095
Source: PubMed

ABSTRACT After migrant chromosomes enter a population, they are progressively sliced into smaller pieces by recombination. Therefore, the length distribution of "migrant tracts" (chromosome segments with recent migrant ancestry) contains information about historical patterns of migration. Here we introduce a theoretical framework describing the migrant tract length distribution and propose a likelihood inference method to test demographic hypotheses and estimate parameters related to a historical change in migration rate. Applying this method to data from the hybridizing subspecies Mus musculus domesticus and M. m. musculus, we find evidence for an increase in the rate of hybridization. Our findings could indicate an evolutionary trajectory toward fusion rather than speciation in these taxa.

0 Followers
 · 
105 Views
  • [Show abstract] [Hide abstract]
    ABSTRACT: Admixture between long-separated populations is a defining feature of the genomes of many species. The mosaic block structure of admixed genomes can provide information about past contact events, including the time and extent of admixture. Here, we describe an improved wavelet-based technique that better characterizes ancestry block structure from observed genomic patterns. Principal Components Analysis is first applied to genomic data to identify the primary population structure, followed by wavelet decomposition to develop a new characterization of local ancestry information along the chromosomes. For testing purposes, this method is applied to human genome-wide genotype data from Indonesia, as well as virtual genetic data generated using genome-scale sequential coalescent simulations under a wide range of admixture scenarios. Time of admixture is inferred using an approximate Bayesian computation framework, providing robust estimates of both admixture times and their associated levels of uncertainty. Crucially, we demonstrate that this revised wavelet approach, which we have released as the R package adwave, provides improved statistical power over existing wavelet-based techniques and can be used to address a broad range of admixture questions. Copyright © 2015, The Genetics Society of America.
    Genetics 04/2015; DOI:10.1534/genetics.115.176842 · 4.87 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: At the crossroads between Africa and Eurasia, Arabia is necessarily a melting pot, its peoples enriched by successive gene flow over the generations. Estimating the timing and impact of these multiple migrations are important steps in reconstructing the key demographic events in the human history. However, current methods based on genome-wide information identify admixture events inefficiently, tending to estimate only the more recent ages, as here in the case of admixture events across the Red Sea (∼8-37 generations for African input into Arabia, and 30-90 generations for "back-to-Africa" migrations). An mtDNA-based founder analysis, corroborated by detailed analysis of the whole-mtDNA genome, affords an alternative means by which to identify, date and quantify multiple migration events at greater time depths, across the full range of modern human history, albeit for the maternal line of descent only. In Arabia, this approach enables us to infer several major pulses of dispersal between the Near East and Arabia, most likely via the Gulf corridor. Although some relict lineages survive in Arabia from the time of the out-of-Africa dispersal, 60 ka, the major episodes in the peopling of the Peninsula took place from north to south in the Late Glacial and, to a lesser extent, the immediate post-glacial/Neolithic. Exchanges across the Red Sea were mainly due to the Arab slave trade and maritime dominance (from ∼2.5 ka to very recent times), but had already begun by the early Holocene, fuelled by the establishment of maritime networks since ∼8 ka. The main "back-to-Africa" migrations, again undetected by genome-wide dating analyses, occurred in the Late Glacial period for introductions into eastern Africa, whilst the Neolithic was more significant for migrations towards North Africa.
    PLoS ONE 03/2015; 10(3):e0118625. DOI:10.1371/journal.pone.0118625 · 3.53 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Empirical phylogeographic studies have progressively sampled greater numbers of loci over time, in part motivated by theoretical papers showing that estimates of key demographic parameters improve as the number of loci increases. Recently, next-generation sequencing has been applied to questions about organismal history, with the promise of revolutionizing the field. However, no systematic assessment of how phylogeographic datasets have changed over time with respect to overall size and information content has been performed. Here, we quantify the changing nature of these genetic datasets over the past 20 years, focusing on papers published in Molecular Ecology. We found that the number of independent loci, the total number of alleles sampled, and the total number of single nucleotide polymorphisms (SNPs) per dataset has improved over time, with particularly dramatic increases within the past five years. Interestingly, uniparentally-inherited organellar markers (e.g., animal mitochondrial and plant chloroplast DNA) continue to represent an important component of phylogeographic data. Single-species studies (cf. comparative studies) that focus on vertebrates (particularly fish and to some extent, birds) represent the gold standard of phylogeographic data collection. Based on the current trajectory seen in our survey data, forecast modelling indicated that the median number of SNPs per dataset for studies published by the end of the year 2016 may approach ~20,000. This survey provides baseline information for understanding the evolution of phylogeographic datasets, and underscores the fact that development of analytical methods for handling very large genetic datasets will be critical for facilitating growth of the field.This article is protected by copyright. All rights reserved.
    Molecular Ecology 02/2015; DOI:10.1111/mec.13108 · 5.84 Impact Factor

Preview

Download
0 Downloads
Available from