Publications (14)216.36 Total impact
-
Dataset: PLoS Genet 2005 Yu
-
Article: A second generation human haplotype map of over 3.1 million SNPs.
[show abstract] [hide abstract]
ABSTRACT: We describe the Phase II HapMap, which characterizes over 3.1 million human single nucleotide polymorphisms (SNPs) genotyped in 270 individuals from four geographically diverse populations and includes 25-35% of common SNP variation in the populations surveyed. The map is estimated to capture untyped common variation with an average maximum r2 of between 0.9 and 0.96 depending on population. We demonstrate that the current generation of commercial genome-wide genotyping products captures common Phase II SNPs with an average maximum r2 of up to 0.8 in African and up to 0.95 in non-African populations, and that potential gains in power in association studies can be obtained through imputation. These data also reveal novel aspects of the structure of linkage disequilibrium. We show that 10-30% of pairs of individuals within a population share at least one region of extended genetic identity arising from recent ancestry and that up to 1% of all common variants are untaggable, primarily because they lie within recombination hotspots. We show that recombination rates vary systematically around genes and between genes of different function. Finally, we demonstrate increased differentiation at non-synonymous, compared to synonymous, SNPs, resulting from systematic differences in the strength or efficacy of natural selection between populations.Nature 11/2007; 449(7164):851-61. · 36.28 Impact Factor -
Article: Genome-wide detection and characterization of positive selection in human populations.
[show abstract] [hide abstract]
ABSTRACT: With the advent of dense maps of human genetic variation, it is now possible to detect positive natural selection across the human genome. Here we report an analysis of over 3 million polymorphisms from the International HapMap Project Phase 2 (HapMap2). We used 'long-range haplotype' methods, which were developed to identify alleles segregating in a population that have undergone recent selection, and we also developed new methods that are based on cross-population comparisons to discover alleles that have swept to near-fixation within a population. The analysis reveals more than 300 strong candidate regions. Focusing on the strongest 22 regions, we develop a heuristic for scrutinizing these regions to identify candidate targets of selection. In a complementary analysis, we identify 26 non-synonymous, coding, single nucleotide polymorphisms showing regional evidence of positive selection. Examination of these candidates highlights three cases in which two genes in a common biological process have apparently undergone positive selection in the same population:LARGE and DMD, both related to infection by the Lassa virus, in West Africa;SLC24A5 and SLC45A2, both involved in skin pigmentation, in Europe; and EDAR and EDA2R, both involved in development of hair follicles, in Asia.Nature 11/2007; 449(7164):913-8. · 36.28 Impact Factor -
Article: Genome-wide detection and characterization of positive selection in human populations
[show abstract] [hide abstract]
ABSTRACT: With the advent of dense maps of human genetic variation, it is now possible to detect positive natural selection across the human genome. Here we report an analysis of over 3 million polymorphisms from the International HapMap Project Phase 2 (HapMap2)Nature 10/2007; 449(7164):913-918. · 36.28 Impact Factor -
Article: Multiplex assay for comprehensive genotyping of genes involved in drug metabolism, excretion, and transport.
[show abstract] [hide abstract]
ABSTRACT: Drug metabolism is a multistep process by which the body disposes of xenobiotic agents such as therapeutic drugs. Genetic variation in the enzymes involved in this process can lead to variability in a patient's response to medication. We used molecular-inversion probe technology to develop a multiplex genotyping assay that can simultaneously test for 1227 genetic variants in 169 genes involved in drug metabolism, excretion, and transport. Within this larger set of variants, we performed analytical validation of a clinically defined core set of 165 variants in 27 genes to assess accuracy, imprecision, and dynamic range. In a test set of 91 samples, genotyping accuracy for the core set probes was 99.8% for called genotypes, with a 1.2% no-call (NC) rate. The majority of the core set probes (133 of 165) had < or = 1 genotyping failure in the test set; a subset of 12 probes was responsible for the majority of failures (mainly NC). Genotyping results were reproducible upon repeat testing with overall within- and between-run variation of 1.1% and 1.4%, respectively-again, primarily NCs in a subset of probes. The assay showed stable genotyping results over a 6-fold range of input DNA. This assay generates a comprehensive assessment of a patient's metabolic genotype and is a tool that can provide a more thorough understanding of patient-to-patient variability in pharmacokinetic responses to drugs.Clinical Chemistry 07/2007; 53(7):1222-30. · 7.91 Impact Factor -
Article: Comprehensive assessment of metabolic enzyme and transporter genes using the Affymetrix Targeted Genotyping System.
[show abstract] [hide abstract]
ABSTRACT: The combined effects of multiple polymorphisms in several drug-metabolizing enzyme and transporter genes can contribute to considerable interindividual variation in drug disposition and response. Therefore, it has been of increasing interest to generate scalable, flexible and cost-effective technologies for large-scale genotyping of the drug-metabolizing enzyme and transporter genes. However, the number of drug-metabolizing enzyme and transporter gene variants exceeds the capacity of current technologies to comprehensively assess multiple polymorphisms in a single, multiplexed assay. The Targeted Genotyping System (Affymetrix, CA, USA) provides a solution to this challenge, by combining molecular inversion probe technology with universal microarrays to provide a method that is capable of analyzing thousands of variants in a single reaction, while remaining relatively insensitive to cross-reactivity between reaction components. This review will focus on the Targeted Genotyping System and how this technology was adapted to enable comprehensive analysis of drug-metabolizing enzyme and transporter gene polymorphisms.Pharmacogenomics 04/2007; 8(3):293-305. · 3.97 Impact Factor -
Article: Large-scale characterization of public database SNPs causing non-synonymous changes in three ethnic groups.
[show abstract] [hide abstract]
ABSTRACT: Single nucleotide polymorphisms (SNPs) that lead to non-synonymous changes in proteins may have functional effects and be subject to selection. Hence they are of particular interest in the study of genetic diseases. We have genotyped approximately 28,000 such SNPs in three ethnic populations (the HapMap plates) and ten primate species and analyzed these data for evidence of selection. We find SNPs predicted by PolyPhen to be damaging, have lower allele frequencies, and are particularly likely to be population-specific. We have also grouped SNPs by molecular function or biological process of the associated genes and find evidence that selection may be acting in concert on classes of genes.Human Genetics 04/2006; 119(1-2):75-83. · 5.07 Impact Factor -
Article: Optimal genotype determination in highly multiplexed SNP data.
[show abstract] [hide abstract]
ABSTRACT: High-throughput genotyping technologies that enable large association studies are already available. Tools for genotype determination starting from raw signal intensities need to be automated, robust, and flexible to provide optimal genotype determination given the specific requirements of a study. The key metrics describing the performance of a custom genotyping study are assay conversion, call rate, and genotype accuracy. These three metrics can be traded off against each other. Using the highly multiplexed Molecular Inversion Probe technology as an example, we describe a methodology for identifying the optimal trade-off. The methodology comprises: a robust clustering algorithm and assessment of a large number of data filter sets. The clustering algorithm allows for automatic genotype determination. Many different sets of filters are then applied to the clustered data, and performance metrics resulting from each filter set are calculated. These performance metrics relate to the power of a study and provide a framework to choose the most suitable filter set to the particular study.European Journal of HumanGenetics 03/2006; 14(2):207-15. · 4.40 Impact Factor -
Article: Large-scale characterization of public database SNPs causing non-synonymous changes in three ethnic groups
[show abstract] [hide abstract]
ABSTRACT: Single nucleotide polymorphisms (SNPs) that lead to non-synonymous changes in proteins may have functional effects and be subject to selection. Hence they are of particular interest in the study of genetic diseases. We have genotyped approximately 28,000 such SNPs in three ethnic populations (the HapMap plates) and ten primate species and analyzed these data for evidence of selection. We find SNPs predicted by PolyPhen to be damaging, have lower allele frequencies, and are particularly likely to be population-specific. We have also grouped SNPs by molecular function or biological process of the associated genes and find evidence that selection may be acting in concert on classes of genes.Human Genetics 02/2006; 119(1):75-83. · 5.07 Impact Factor -
Article: Population structure, differential bias and genomic control in a large-scale, case-control association study.
[show abstract] [hide abstract]
ABSTRACT: The main problems in drawing causal inferences from epidemiological case-control studies are confounding by unmeasured extraneous factors, selection bias and differential misclassification of exposure. In genetics the first of these, in the form of population structure, has dominated recent debate. Population structure explained part of the significant +11.2% inflation of test statistics we observed in an analysis of 6,322 nonsynonymous SNPs in 816 cases of type 1 diabetes and 877 population-based controls from Great Britain. The remainder of the inflation resulted from differential bias in genotype scoring between case and control DNA samples, which originated from two laboratories, causing false-positive associations. To avoid excluding SNPs and losing valuable information, we extended the genomic control method by applying a variable downweighting to each SNP.Nature Genetics 12/2005; 37(11):1243-6. · 35.53 Impact Factor -
Article: Positive Selection of a Pre-Expansion CAG Repeat of the Human SCA2 Gene
[show abstract] [hide abstract]
ABSTRACT: Organismic and Evolutionary Biology A region of approximately one megabase of human Chromosome 12 shows extensive linkage disequilibrium in Utah residents with ancestry from northern and western Europe. This strikingly large linkage disequilibrium block was analyzed with statistical and experimental methods to determine whether natural selection could be implicated in shaping the current genome structure. Extended Haplotype Homozygosity and Relative Extended Haplotype Homozygosity analyses on this region mapped a core region of the strongest conserved haplotype to the exon 1 of the Spinocerebellar ataxia type 2 gene (SCA2). Direct DNA sequencing of this region of the SCA2 gene revealed a significant association between a pre-expanded allele [(CAG)[sub]8CAA(CAG)[sub]4CAA(CAG)[sub]8] of CAG repeats within exon 1 and the selected haplotype of the SCA2 gene. A significantly negative Tajima's D value (−2.20, p < 0.01) on this site consistently suggested selection on the CAG repeat. This region was also investigated in the three other populations, none of which showed signs of selection. These results suggest that a recent positive selection of the pre-expansion SCA2 CAG repeat has occurred in Utah residents with European ancestry.PLoS Genetics 09/2005; · 8.69 Impact Factor -
Article: Highly multiplexed molecular inversion probe genotyping: over 10,000 targeted SNPs genotyped in a single tube assay.
[show abstract] [hide abstract]
ABSTRACT: Large-scale genetic studies are highly dependent on efficient and scalable multiplex SNP assays. In this study, we report the development of Molecular Inversion Probe technology with four-color, single array detection, applied to large-scale genotyping of up to 12,000 SNPs per reaction. While generating 38,429 SNP assays using this technology in a population of 30 trios from the Centre d'Etude Polymorphisme Humain family panel as part of the International HapMap project, we established SNP conversion rates of approximately 90% with concordance rates >99.6% and completeness levels >98% for assays multiplexed up to 12,000plex levels. Furthermore, these individual metrics can be "traded off" and, by sacrificing a small fraction of the conversion rate, the accuracy can be increased to very high levels. No loss of performance is seen when scaling from 6,000plex to 12,000plex assays, strongly validating the ability of the technology to suppress cross-reactivity at high multiplex levels. The results of this study demonstrate the suitability of this technology for comprehensive association studies that use targeted SNPs in indirect linkage disequilibrium studies or that directly screen for causative mutations.Genome Research 03/2005; 15(2):269-75. · 13.61 Impact Factor -
Article: Multiplexed genotyping with sequence-tagged molecular inversion probes.
[show abstract] [hide abstract]
ABSTRACT: We report on the development of molecular inversion probe (MIP) genotyping, an efficient technology for large-scale single nucleotide polymorphism (SNP) analysis. This technique uses MIPs to produce inverted sequences, which undergo a unimolecular rearrangement and are then amplified by PCR using common primers and analyzed using universal sequence tag DNA microarrays, resulting in highly specific genotyping. With this technology, multiplex analysis of more than 1,000 probes in a single tube can be done using standard laboratory equipment. Genotypes are generated with a high call rate (95%) and high accuracy (>99%) as determined by independent sequencing.Nature Biotechnology 06/2003; 21(6):673-8. · 23.27 Impact Factor -
Article: A second generation human haplotype map of over 3.1 million SNPs
[show abstract] [hide abstract]
ABSTRACT: We describe the Phase II HapMap, which characterizes over 3.1 million human single nucleotide polymorphisms (SNPs) genotyped in 270 individuals from four geographically diverse populations and includes 25-35% of common SNP variation in the populations surveyed. The map is estimated to capture untyped common variation with an average maximum r(2) of between 0.9 and 0.96 depending on population. We demonstrate that the current generation of commercial genome-wide genotyping products captures common Phase II SNPs with an average maximum r(2) of up to 0.8 in African and up to 0.95 in non-African populations, and that potential gains in power in association studies can be obtained through imputation. These data also reveal novel aspects of the structure of linkage disequilibrium. We show that 10-30% of pairs of individuals within a population share at least one region of extended genetic identity arising from recent ancestry and that up to 1% of all common variants are untaggable, primarily because they lie within recombination hotspots. We show that recombination rates vary systematically around genes and between genes of different function. Finally, we demonstrate increased differentiation at non-synonymous, compared to synonymous, SNPs, resulting from systematic differences in the strength or efficacy of natural selection between populations.Nature.
Top Journals
- Human Genetics (2)
- Nature (2)
- Clinical Chemistry (1)
- Nature (1)
- European Journal of HumanGenetics (1)
Institutions
-
2007
-
Broad Institute of MIT and Harvard
Cambridge, MA, USA -
The Scripps Research Institute
La Jolla, CA, USA
-
-
2003
-
Stanford University
- Stanford Genome Technology Center
Stanford, CA, USA
-