Review and Evaluation of Methods Correcting for Population Stratification with a Focus on Underlying Statistical Principles

Department of Biostatistics, Section on Statistical Genetics, University of Alabama at Birmingham, Birmingham, AL 35294, USA.
Human Heredity (Impact Factor: 1.64). 02/2008; 66(2):67-86. DOI: 10.1159/000119107
Source: PubMed

ABSTRACT When two or more populations have been separated by geographic or cultural boundaries for many generations, drift, spontaneous mutations, differential selection pressures and other factors may lead to allele frequency differences among populations. If these 'parental' populations subsequently come together and begin inter-mating, disequilibrium among linked markers may span a greater genetic distance than it typically does among populations under panmixia [see glossary]. This extended disequilibrium can make association studies highly effective and more economical than disequilibrium mapping in panmictic populations since less marker loci are needed to detect regions of the genome that harbor phenotype-influencing loci. However, under some circumstances, this process of intermating (as well as other processes) can produce disequilibrium between pairs of unlinked loci and thus create the possibility of confounding or spurious associations due to this population stratification. Accordingly, researchers are advised to employ valid statistical tests for linkage disequilibrium mapping allowing conduct of genetic association studies that control for such confounding. Many recent papers have addressed this need. We provide a comprehensive review of advances made in recent years in correcting for population stratification and then evaluate and synthesize these methods based on statistical principles such as (1) randomization, (2) conditioning on sufficient statistics, and (3) identifying whether the method is based on testing the genotype-phenotype covariance (conditional upon familial information) and/or testing departures of the marginal distribution from the expected genotypic frequencies.

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Maternal genetic and phenotypic characteristics (e.g., metabolic and behavioral) affect both the intrauterine milieu and lifelong health trajectories of their fetuses. Yet at the same time, fetal genotype may affect processes that alter pre and postnatal maternal physiology, and the subsequent health of both fetus and mother. We refer to these latter effects as 'fetal drive.' If fetal genotype is driving physiologic, metabolic, and behavioral phenotypic changes in the mother, there is a possibility of differential effects with different fetal genomes inducing different long-term effects on both maternal and fetal health, mediated through intrauterine environment. This proposed mechanistic path remains largely unexamined and untested. In this study, we offer a statistical method to rigorously test this hypothesis and make causal inferences in humans by relying on the (conditional) randomization inherent in the process of meiosis. For illustration, we apply this method to a dataset from the Framingham Heart Study.
    Frontiers in Genetics 01/2014; 5:464. DOI:10.3389/fgene.2014.00464
  • [Show abstract] [Hide abstract]
    ABSTRACT: This paper presents an overview of historical advances and the current state of genetic psychophysiology, a rapidly developing interdisciplinary research linking genetics, brain, and human behavior, discusses methodological problems, and outlines future directions of research. The main goals of genetic psychophysiology are to elucidate the neural pathways and mechanisms mediating genetic influences on cognition and emotion, identify intermediate brain-based phenotypes for psychopathology, and provide a functional characterization of genes being discovered by large association studies of behavioral phenotypes. Since the initiation of this neurogenetic approach to human individual differences in the 1970s, numerous twin and family studies have provided strong evidence for heritability of diverse aspects of brain function including resting-state brain oscillations, functional connectivity, and event-related neural activity in a variety of cognitive and emotion processing tasks, as well as peripheral psychophysiological responses. These data indicate large differences in the presence and strength of genetic influences across measures and domains, permitting the selection of heritable characteristics for gene finding studies. More recently, candidate gene association studies began to implicate specific genetic variants in different aspects of neurocognition. However, great caution is needed in pursuing this line of research due to its demonstrated proneness to generate false-positive findings. Recent developments in methods for physiological signal analysis, hemodynamic imaging, and genomic technologies offer new exciting opportunities for the investigation of the interplay between genetic and environmental factors in the development of individual differences in behavior, both normal and abnormal.
    International journal of psychophysiology: official journal of the International Organization of Psychophysiology 04/2014; DOI:10.1016/j.ijpsycho.2014.04.003 · 2.65 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Ancestry-informative markers (AIMs) are powerful tools for inferring the genetic composition of admixed populations. In this study, we determined the genetic ancestry of the Ouro Preto (Brazil) population and evaluated the association between ancestry and self-reported skin color. The genetic ancestry of 189 children and adolescents was estimated by genotyping 15 AIMs. The estimate of population admixture was determined using the Bayesian Markov Chain Monte Carlo (MCMC) method implemented in two different programs (STRUCTURE and ADMIXMAP). Volunteers self-reported their skin colors. The European ancestry contribution ranged from 0.503 to 0.539, the African contribution ranged from 0.333 to 0.425, and the Amerindian component ranged from 0.04 to 0.164. The relative contributions of African (P < 0.016) and European (P < 0.011) ancestry differed significantly among skin color groups, except between black and dark-brown groups. The population of Ouro Preto has a higher contribution of African ancestry compared to the mean for the southeast region of Brazil. Therefore, extrapolating the African ancestry contribution for southeastern Brazil to the Ouro Preto population would underestimate the actual value for this city. We also showed that self-reported skin color could be appropriate for describing the genetic structure of this particular population.
    Genetics and molecular research: GMR 01/2013; 12(4):5124-33. DOI:10.4238/2013.October.29.6 · 0.85 Impact Factor


1 Download
Available from