Diversity of Lactase Persistence Alleles in Ethiopia: Signature of a Soft Selective Sweep.

Research Department of Genetics Evolution and Environment, University College London, Darwin Building, London WC1E 6BT, UK.
The American Journal of Human Genetics (Impact Factor: 10.99). 08/2013; DOI: 10.1016/j.ajhg.2013.07.008
Source: PubMed

ABSTRACT The persistent expression of lactase into adulthood in humans is a recent genetic adaptation that allows the consumption of milk from other mammals after weaning. In Europe, a single allele (-13910(∗)T, rs4988235) in an upstream region that acts as an enhancer to the expression of the lactase gene LCT is responsible for lactase persistence and appears to have been under strong directional selection in the last 5,000 years, evidenced by the widespread occurrence of this allele on an extended haplotype. In Africa and the Middle East, the situation is more complicated and at least three other alleles (-13907(∗)G, rs41525747; -13915(∗)G, rs41380347; -14010(∗)C, rs145946881) in the same LCT enhancer region can cause continued lactase expression. Here we examine the LCT enhancer sequence in a large lactose-tolerance-tested Ethiopian cohort of more than 350 individuals. We show that a further SNP, -14009T>G (ss 820486563), is significantly associated with lactose-digester status, and in vitro functional tests confirm that the -14009(∗)G allele also increases expression of an LCT promoter construct. The derived alleles in the LCT enhancer region are spread through several ethnic groups, and we report a greater genetic diversity in lactose digesters than in nondigesters. By examining flanking markers to control for the effects of mutation and demography, we further describe, from empirical evidence, the signature of a soft selective sweep.

  • [Show abstract] [Hide abstract]
    ABSTRACT: Lactase persistence (LP), the ability to digest lactose into adulthood, is strongly associated with the cultural traits of pastoralism and milk-drinking among human populations, and several different genetic variants are known that confer LP. Recent studies of LP variants in Southern African populations, with a focus on Khoisan-speaking groups, found high frequencies of an LP variant (the C-14010 allele) that also occurs in Eastern Africa, and concluded that the C-14010 allele was brought to Southern Africa via a migration of pastoralists from Eastern Africa. However, this conclusion was based on indirect evidence; to date no study has jointly analyzed data on the C-14010 allele from both Southern African Khoisan-speaking groups and Eastern Africa. Here, we combine and analyze published data on the C-14010 allele in Southern and Eastern African populations, consisting of haplotypes with the C-14010 allele and four closely-linked short tandem repeat loci. Our results provide direct evidence for the previously-hypothesized Eastern African origin of the C-14010 allele in Southern African Khoisan-speaking groups. In addition, we find evidence for a separate introduction of the C-14010 allele into the Bantu-speaking Xhosa. The estimated selection intensity on the C-14010 allele in Eastern Africa is lower than that in Southern Africa, which suggests that in Eastern Africa the dietary changes conferring the fitness advantage associated with LP occurred some time after the origin of the C-14010 allele. Conversely, in Southern Africa the fitness advantage was present when the allele was introduced, as would be expected if pastoralism was introduced concomitantly. Am J Phys Anthropol, 2014. © 2014 Wiley Periodicals, Inc.
    American Journal of Physical Anthropology 12/2014; 156(4). DOI:10.1002/ajpa.22675 · 2.51 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: This review explores the limitations of self-reported race, ethnicity, and genetic ancestry in biomedical research. Various terminologies are used to classify human differences in genomic research including race, ethnicity, and ancestry. Although race and ethnicity are related, race refers to a person¿s physical appearance, such as skin color and eye color. Ethnicity, on the other hand, refers to communality in cultural heritage, language, social practice, traditions, and geopolitical factors. Genetic ancestry inferred using ancestry informative markers (AIMs) is based on genetic/genomic data. Phenotype-based race/ethnicity information and data computed using AIMs often disagree. For example, self-reporting African Americans can have drastically different levels of African or European ancestry. Genetic analysis of individual ancestry shows that some self-identified African Americans have up to 99% of European ancestry, whereas some self-identified European Americans have substantial admixture from African ancestry. Similarly, African ancestry in the Latino population varies between 3% in Mexican Americans to 16% in Puerto Ricans. The implication of this is that, in African American or Latino populations, self-reported ancestry may not be as accurate as direct assessment of individual genomic information in predicting treatment outcomes. To better understand human genetic variation in the context of health disparities, we suggest using ¿ancestry¿ (or biogeographical ancestry) to describe actual genetic variation, ¿race¿ to describe health disparity in societies characterized by racial categories, and ¿ethnicity¿ to describe traditions, lifestyle, diet, and values. We also suggest using ancestry informative markers for precise characterization of individuals¿ biological ancestry. Understanding the sources of human genetic variation and the causes of health disparities could lead to interventions that would improve the health of all individuals.
    Human genomics 01/2015; 9(1):1. DOI:10.1186/PREACCEPT-2695828013752627
  • [Show abstract] [Hide abstract]
    ABSTRACT: We attempted to confirm the resemblance of a local medieval population and to reconstruct their contribution to the formation of the modern Polish population at the DNA level. The HVR I mtDNA sequence and two nuclear alleles, LCT-13910C/T SNP and deltaF508 CFTR, were chosen as markers since the distribution of selected nuclear alleles varies among ethnic groups. A total of 47 specimens were selected from a medieval cemetery in Cedynia (located in the western Polish lowland). Regarding the HVR I profile, the analyzed population differed from the present-day population (P=0.045, Fst=0.0103), in contrast to lactase persistence (LP) based on the LCT-13910T allele, thus indicating the lack of notable frequency changes of this allele during the last millennium (P=0.141). The sequence of the HVR I mtDNA fragment allowed to identify six major haplogroups including H, U5, T, K, and HV0 within the medieval population of Cedynia which are common in today's central Europe. An analysis of haplogroup frequency and its comparison with modern European populations shows that the studied medieval population is more closely related to Finno-Ugric populations than to the present Polish population. Identification of less common haplogroups, i.e., Z and U2, both atypical of the modern Polish population and of Asian origin, provides evidence for some kind of connections between the studied and foreign populations. Furthermore, a comparison of the available aDNA sequences from medieval Europe suggests that populations differed from one another and a number of data from other locations are required to find out more about the features of the medieval gene pool profile. Copyright © 2015 Elsevier GmbH. All rights reserved.
    HOMO - Journal of Comparative Human Biology 03/2015; DOI:10.1016/j.jchb.2014.11.003 · 0.73 Impact Factor