[Show abstract][Hide abstract]ABSTRACT: One megabase windows displaying extended regions of statistical significance calculated using the hypergeometric distribution for LSBL, lnRH, and the standardized difference of D. All windows listed are statistically significant for the standardized difference of D and LSBL or lnRH as indicated in column 2.
(0.12 MB DOC)
[Show abstract][Hide abstract]ABSTRACT: Genes encoded by the four consecutive one megabase windows on chromosome 12 spanning 109,000,000 bp to 113,000,000 bp that were significant for the hypergeometric distribution.
(0.07 MB DOC)
[Show abstract][Hide abstract]ABSTRACT: Indigenous American ancestry estimates. (A) The effect of admixture on PC plot with all 439,046 SNPs (B) Intermediate PC plot with 240,969 SNPs (threshold 0.9) showing how the gap with the European cluster is reduced. Final step, with threshold 0.8 is shown in Figure 2A (where Europeans (EUR) have been removed to simplify the graphical representation).
(0.27 MB TIF)
[Show abstract][Hide abstract]ABSTRACT: High-altitude hypoxia (reduced inspired oxygen tension due to decreased barometric pressure) exerts severe physiological stress on the human body. Two high-altitude regions where humans have lived for millennia are the Andean Altiplano and the Tibetan Plateau. Populations living in these regions exhibit unique circulatory, respiratory, and hematological adaptations to life at high altitude. Although these responses have been well characterized physiologically, their underlying genetic basis remains unknown. We performed a genome scan to identify genes showing evidence of adaptation to hypoxia. We looked across each chromosome to identify genomic regions with previously unknown function with respect to altitude phenotypes. In addition, groups of genes functioning in oxygen metabolism and sensing were examined to test the hypothesis that particular pathways have been involved in genetic adaptation to altitude. Applying four population genetic statistics commonly used for detecting signatures of natural selection, we identified selection-nominated candidate genes and gene regions in these two populations (Andeans and Tibetans) separately. The Tibetan and Andean patterns of genetic adaptation are largely distinct from one another, with both populations showing evidence of positive natural selection in different genes or gene regions. Interestingly, one gene previously known to be important in cellular oxygen sensing, EGLN1 (also known as PHD2), shows evidence of positive selection in both Tibetans and Andeans. However, the pattern of variation for this gene differs between the two populations. Our results indicate that several key HIF-regulatory and targeted genes are responsible for adaptation to high altitude in Andeans and Tibetans, and several different chromosomal regions are implicated in the putative response to selection. These data suggest a genetic role in high-altitude adaption and provide a basis for future genotype/phenotype association studies necessary to confirm the role of selection-nominated candidate genes and gene regions in adaptation to altitude.
[Show abstract][Hide abstract]ABSTRACT: High-altitude environments (>2,500 m) provide scientists with a natural laboratory to study the physiological and genetic effects of low ambient oxygen tension on human populations. One approach to understanding how life at high altitude has affected human metabolism is to survey genome-wide datasets for signatures of natural selection. In this work, we report on a study to identify selection-nominated candidate genes involved in adaptation to hypoxia in one highland group, Andeans from the South American Altiplano. We analysed dense microarray genotype data using four test statistics that detect departures from neutrality. Using a candidate gene, single nucleotide polymorphism-based approach, we identified genes exhibiting preliminary evidence of recent genetic adaptation in this population. These included genes that are part of the hypoxia-inducible transcription factor ( HIF ) pathway, a biochemical pathway involved in oxygen homeostasis, as well as three other genomic regions previously not known to be associated with high-altitude phenotypes. In addition to identifying selection-nominated candidate genes, we also tested whether the HIF pathway shows evidence of natural selection. Our results indicate that the genes of this biochemical pathway as a group show no evidence of having evolved in response to hypoxia in Andeans. Results from particular HIF -targeted genes, however, suggest that genes in this pathway could play a role in Andean adaptation to high altitude, even if the pathway as a whole does not show higher relative rates of evolution. These data suggest a genetic role in high-altitude adaptation and provide a basis for genotype/phenotype association studies that are necessary to confirm the role of putative natural selection candidate genes and gene regions in adaptation to altitude.
[Show abstract][Hide abstract]ABSTRACT: Dissecting the genetic basis of disease risk requires measuring all forms of genetic variation, including SNPs and copy number variants (CNVs), and is enabled by accurate maps of their locations, frequencies and population-genetic properties. We designed a hybrid genotyping array (Affymetrix SNP 6.0) to simultaneously measure 906,600 SNPs and copy number at 1.8 million genomic locations. By characterizing 270 HapMap samples, we developed a map of human CNV (at 2-kb breakpoint resolution) informed by integer genotypes for 1,320 copy number polymorphisms (CNPs) that segregate at an allele frequency >1%. More than 80% of the sequence in previously reported CNV regions fell outside our estimated CNV boundaries, indicating that large (>100 kb) CNVs affect much less of the genome than initially reported. Approximately 80% of observed copy number differences between pairs of individuals were due to common CNPs with an allele frequency >5%, and more than 99% derived from inheritance rather than new mutation. Most common, diallelic CNPs were in strong linkage disequilibrium with SNPs, and most low-frequency CNVs segregated on specific SNP haplotypes.
[Show abstract][Hide abstract]ABSTRACT: While the number of success stories for mapping genes associated with complex diseases using genome-wide association approaches is growing, there is still much work to be done in developing methods for such studies when the samples are collected from a population, which may not be homogeneous. Here we report the first genome-wide association study to identify genes associated with asthma in an admixed population. We genotyped 96 Puerto Rican moderate to severe asthma cases and 88 controls as well as 109 samples representing Puerto Rico's founding populations using the Affymetrix GeneChip Human Mapping 100K array sets. The data from samples representing Puerto Rico's founding populations was used to identify ancestry informative markers for admixture mapping analyses. In addition, a genome-wide association analysis using logistic regression was performed on the data. Although neither admixture mapping nor regression analysis gave any significant association with asthma after correction for multiple testing, an overlap analysis using the top scoring SNPs from different methods suggested chromosomal regions 5q23.3 and 13q13.3 as potential regions harboring genes for asthma in Puerto Ricans. The validation analysis of these two regions in 284 Puerto Rican asthma trios gave significant association for the 5q23.3 region. Our results provide strong evidence that the previously linked 5q23 region is associated with asthma in Puerto Ricans. The detection of causative variants in this region will require fine mapping and functional validation.
[Show abstract][Hide abstract]ABSTRACT: Recent studies have used dense markers to examine the human genome in ancestrally homogeneous populations for hallmarks of selection. No genomewide studies have focused on recently admixed groups--populations that have experienced admixing among continentally divided ancestral populations within the past 200-500 years. New World admixed populations are unique in that they represent the sudden confluence of geographically diverged genomes with novel environmental challenges. Here, we present a novel approach for studying selection by examining the genomewide distribution of ancestry in the genetically admixed Puerto Ricans. We find strong statistical evidence of recent selection in three chromosomal regions, including the human leukocyte antigen region on chromosome 6p, chromosome 8q, and chromosome 11q. Two of these regions harbor genes for olfactory receptors. Interestingly, all three regions exhibit deficiencies in the European-ancestry proportion.
Full-text · Article · Oct 2007 · The American Journal of Human Genetics
[Show abstract][Hide abstract]ABSTRACT: Admixture mapping (AM) is a promising method for the identification of genetic risk factors for complex traits and diseases showing prevalence differences among populations. Efficient application of this method requires the use of a genomewide panel of ancestry-informative markers (AIMs) to infer the population of origin of chromosomal regions in admixed individuals. Genomewide AM panels with markers showing high frequency differences between West African and European populations are already available for disease-gene discovery in African Americans. However, no such a map is yet available for Hispanic/Latino populations, which are the result of two-way admixture between Native American and European populations or of three-way admixture of Native American, European, and West African populations. Here, we report a genomewide AM panel with 2,120 AIMs showing high frequency differences between Native American and European populations. The average intermarker genetic distance is ~1.7 cM. The panel was identified by genotyping, with the Affymetrix GeneChip Human Mapping 500K array, a population sample with European ancestry, a Mesoamerican sample comprising Maya and Nahua from Mexico, and a South American sample comprising Aymara/Quechua from Bolivia and Quechua from Peru. The main criteria for marker selection were both high information content for Native American/European ancestry (measured as the standardized variance of the allele frequencies, also known as "f value") and small frequency differences between the Mesoamerican and South American samples. This genomewide AM panel will make it possible to apply AM approaches in many admixed populations throughout the Americas.
Full-text · Article · Jul 2007 · The American Journal of Human Genetics