[Show abstract][Hide abstract] ABSTRACT: We performed a Phenome-wide association study (PheWAS) utilizing diverse genotypic and phenotypic data existing across multiple populations in the National Health and Nutrition Examination Surveys (NHANES), conducted by the Centers for Disease Control and Prevention (CDC), and accessed by the Epidemiological Architecture for Genes Linked to Environment (EAGLE) study. We calculated comprehensive tests of association in Genetic NHANES using 80 SNPs and 1,008 phenotypes (grouped into 184 phenotype classes), stratified by race-ethnicity. Genetic NHANES includes three surveys (NHANES III, 1999-2000, and 2001-2002) and three race-ethnicities: non-Hispanic whites (n = 6,634), non-Hispanic blacks (n = 3,458), and Mexican Americans (n = 3,950). We identified 69 PheWAS associations replicating across surveys for the same SNP, phenotype-class, direction of effect, and race-ethnicity at p<0.01, allele frequency >0.01, and sample size >200. Of these 69 PheWAS associations, 39 replicated previously reported SNP-phenotype associations, 9 were related to previously reported associations, and 21 were novel associations. Fourteen results had the same direction of effect across more than one race-ethnicity: one result was novel, 11 replicated previously reported associations, and two were related to previously reported results. Thirteen SNPs showed evidence of pleiotropy. We further explored results with gene-based biological networks, contrasting the direction of effect for pleiotropic associations across phenotypes. One PheWAS result was ABCG2 missense SNP rs2231142, associated with uric acid levels in both non-Hispanic whites and Mexican Americans, protoporphyrin levels in non-Hispanic whites and Mexican Americans, and blood pressure levels in Mexican Americans. Another example was SNP rs1800588 near LIPC, significantly associated with the novel phenotypes of folate levels (Mexican Americans), vitamin E levels (non-Hispanic whites) and triglyceride levels (non-Hispanic whites), and replication for cholesterol levels. The results of this PheWAS show the utility of this approach for exposing more of the complex genetic architecture underlying multiple traits, through generating novel hypotheses for future research.
[Show abstract][Hide abstract] ABSTRACT: -A founder mutation was recently discovered and described as conferring favorable lipid profiles and reduced subclinical atherosclerotic disease in a Pennsylvania Amish population. Preliminary data have suggested that this null mutation APOC3 R19X (rs76353203) is rare in the general population.
[Show abstract][Hide abstract] ABSTRACT: Herpes zoster, commonly referred to as shingles, is caused by the varicella zoster virus (VZV). VZV initially manifests as chicken pox, most commonly in childhood, can remain asymptomatically latent in nerve tissues for many years and often re-emerges as shingles. Although reactivation may be related to immune suppression, aging and female sex, most inter-individual variability in re-emergence risk has not been explained to date. We performed a genome-wide association analyses in 22 981 participants (2280 shingles cases) from the electronic Medical Records and Genomics Network. Using Cox survival and logistic regression, we identified a genomic region in the combined and European ancestry groups that has an age of onset effect reaching genome-wide significance (P>1.0 × 10(-8)). This region tags the non-coding gene HCP5 (HLA Complex P5) in the major histocompatibility complex. This gene is an endogenous retrovirus and likely influences viral activity through regulatory functions. Variants in this genetic region are known to be associated with delay in development of AIDS in people infected by HIV. Our study provides further suggestion that this region may have a critical role in viral suppression and could potentially harbor a clinically actionable variant for the shingles vaccine.Genes and Immunity advance online publication, 9 October 2014; doi:10.1038/gene.2014.51.
[Show abstract][Hide abstract] ABSTRACT: Using genome-wide data from 253,288 individuals, we identified 697 variants at genome-wide significance that together explained one-fifth of the heritability for adult height. By testing different numbers of variants in independent studies, we show that the most strongly associated approximately 2,000, approximately 3,700 and approximately 9,500 SNPs explained approximately 21%, approximately 24% and approximately 29% of phenotypic variance. Furthermore, all common variants together captured 60% of heritability. The 697 variants clustered in 423 loci were enriched for genes, pathways and tissue types known to be involved in growth and together implicated genes and pathways not highlighted in earlier efforts, such as signaling by fibroblast growth factors, WNT/beta-catenin and chondroitin sulfate-related genes. We identified several genes and pathways not previously connected with human skeletal growth, including mTOR, osteoglycin and binding of hyaluronic acid. Our results indicate a genetic architecture for human height that is characterized by a very large but finite number (thousands) of causal variants.
[Show abstract][Hide abstract] ABSTRACT: Purpose:Substantial progress has been made in identifying susceptibility variants for age-related macular degeneration (AMD) in European populations; however, few studies have been conducted to understand the role these variants play in AMD risk in diverse populations. The present study aims to examine AMD risk across diverse populations in known and suspected AMD complement factor and lipid-related loci. Methods:Targeted genotyping was performed across study sites for AMD and lipid trait-associated SNPs. Genetic association tests were performed at individual sites and then meta-analyzed using logistic regression assuming an additive genetic model stratified by self-described race/ethnicity to determine risk of any AMD. Participants included cases with early or late AMD and controls with no signs of AMD as determined by fundus photography. Populations included in this study were European Americans, African Americans, Mexican Americans, and Singaporeans from the Population Architecture using Genomics and Epidemiology (PAGE) study. Results:AMD index variants rs1061170 (CFH) and rs10490924 (ARMS2) were associated with AMD at p=3.05x10-8 and p=6.36x10-6, respectively, in European Americans. In general, none of the major AMD index variants generalized to our non-European populations with the exception of rs10490924 in Mexican Americans (p<0.05). Four lipid-associated SNPs (LPL rs328, TRIB1 rs6987702, CETP rs1800775, and KCTD10/MVK rs2338104) were associated with AMD in African Americans and Mexican Americans at a liberal significance threshold (p<0.05). Conclusions:While most associations did not generalize in the non-European populations, variants within lipid-related genes were found to be associated with AMD. This study highlights the need for larger well-powered studies in non-European populations.
[Show abstract][Hide abstract] ABSTRACT: Sharing genomic data is crucial to support scientific investigation such as genome-wide association studies (GWAS). However, recent investigations suggest the privacy of the individual participants in these studies can be compromised, leading to serious concerns and consequences, such as overly restricted access to data.
[Show abstract][Hide abstract] ABSTRACT: Background: Multiple primary cancers account for ~16% of all incident cancers in the U.S.. While genome-wide association studies (GWAS) have identified many common genetic variants associated with various cancer sites, no study has examined the association of these genetic variants with risk of multiple primary cancers (MPC). Methods: As part of the NHGRI Population Architecture using Genomics and Epidemiology (PAGE) study, we used data from the Multiethnic Cohort and Women's Health Initiative. Incident MPC (IMPC) cases (n=1,385) were defined as participants diagnosed with >1 incident cancers after cohort entry. Participants diagnosed with only one incident cancer after cohort entry with follow-up equal to or longer than IMPC cases served as controls (single-index cancer controls; n= 9,626). Fixed-effects meta-analyses of unconditional logistic regression analyses were used to evaluate the association between cancer risk variants and IMPC risk. To account for multiple comparisons, we used the false positive report probability (FPRP) to determine statistical significance. Results: A nicotine dependence-associated and lung cancer variant, CHRNA3 rs578776 (OR=1.16, 95% CI=1.05-1.26; p=0.004) and two breast cancer variants, EMBP1 rs11249433 and TOX3 rs3803662 (OR=1.16, 95% CI=1.04-1.28; p=0.005 and OR=1.13, 95% CI=1.03-1.23; p=0.006) were significantly associated with risk of IMPC. The associations for rs578776 and rs11249433 remained (p<0.05) after removing subjects who had lung or breast cancers, respectively (p-values≤0.046). These associations did not show significant heterogeneity by smoking status (p-heterogeneity≥0.53). Conclusions: Our study has identified rs578776 and rs11249433 as risk variants for IMPC. Impact: These findings may help to identify genetic regions associated with IMPC risk.
[Show abstract][Hide abstract] ABSTRACT: Phenome-wide association studies (PheWAS) have demonstrated utility in validating genetic associations derived from traditional genetic studies as well as identifying novel genetic associations. Here we used an electronic health record (EHR)-based PheWAS to explore pleiotropy of genetic variants in the fat mass and obesity associated gene (FTO), some of which have been previously associated with obesity and type 2 diabetes (T2D). We used a population of 10,487 individuals of European ancestry with genome-wide genotyping from the Electronic Medical Records and Genomics (eMERGE) Network and another population of 13,711 individuals of European ancestry from the BioVU DNA biobank at Vanderbilt genotyped using Illumina HumanExome BeadChip. A meta-analysis of the two study populations replicated the well-described associations between FTO variants and obesity (odds ratio [OR] = 1.25, 95% Confidence Interval = 1.11-1.24, p = 2.10 × 10(-9)) and FTO variants and T2D (OR = 1.14, 95% CI = 1.08-1.21, p = 2.34 × 10(-6)). The meta-analysis also demonstrated that FTO variant rs8050136 was significantly associated with sleep apnea (OR = 1.14, 95% CI = 1.07-1.22, p = 3.33 × 10(-5)); however, the association was attenuated after adjustment for body mass index (BMI). Novel phenotype associations with obesity-associated FTO variants included fibrocystic breast disease (rs9941349, OR = 0.81, 95% CI = 0.74-0.91, p = 5.41 × 10(-5)) and trends toward associations with non-alcoholic liver disease and gram-positive bacterial infections. FTO variants not associated with obesity demonstrated other potential disease associations including non-inflammatory disorders of the cervix and chronic periodontitis. These results suggest that genetic variants in FTO may have pleiotropic associations, some of which are not mediated by obesity.
[Show abstract][Hide abstract] ABSTRACT: Type 2 diabetes (T2D) is more prevalent in African Americans than in Europeans. However, little is known about the genetic risk in African Americans despite the recent identification of more than 70 T2D loci primarily by genome-wide association studies (GWAS) in individuals of European ancestry. In order to investigate the genetic architecture of T2D in African Americans, the MEta-analysis of type 2 DIabetes in African Americans (MEDIA) Consortium examined 17 GWAS on T2D comprising 8,284 cases and 15,543 controls in African Americans in stage 1 analysis. Single nucleotide polymorphisms (SNPs) association analysis was conducted in each study under the additive model after adjustment for age, sex, study site, and principal components. Meta-analysis of approximately 2.6 million genotyped and imputed SNPs in all studies was conducted using an inverse variance-weighted fixed effect model. Replications were performed to follow up 21 loci in up to 6,061 cases and 5,483 controls in African Americans, and 8,130 cases and 38,987 controls of European ancestry. We identified three known loci (TCF7L2, HMGA2 and KCNQ1) and two novel loci (HLA-B and INS-IGF2) at genome-wide significance (4.15×10-94<P<5×10-8, odds ratio (OR) = 1.09 to 1.36). Fine-mapping revealed that 88 of 158 previously identified T2D or glucose homeostasis loci demonstrated nominal to highly significant association (2.2×10-23 < locus-wide P<0.05). These novel and previously identified loci yielded a sibling relative risk of 1.19, explaining 17.5% of the phenotypic variance of T2D on the liability scale in African Americans. Overall, this study identified two novel susceptibility loci for T2D in African Americans. A substantial number of previously reported loci are transferable to African Americans after accounting for linkage disequilibrium, enabling fine mapping of causal variants in trans-ethnic meta-analysis studies.
[Show abstract][Hide abstract] ABSTRACT: Both environmental and genetic factors impact type 2 diabetes (T2D). To identify such modifiers, we genotyped 15 T2D-associated variants from genome-wide association studies (GWAS) in 6,414 non-Hispanic whites, 3,073 non-Hispanic blacks, and 3,633 Mexican American participants from the National Health and Nutrition Examination Surveys (NHANES) and evaluated interactions between these variants and carbohydrate intake and fiber intake.
[Show abstract][Hide abstract] ABSTRACT: Genome-wide association studies (GWAS) have identified a large number of cancer-associated single nucleotide polymorphisms (SNPs), several of which have been associated with multiple cancer sites suggesting pleiotropic effects and shared biological mechanisms across some cancers. We hypothesized that SNPs associated with other cancers may be additionally associated with endometrial cancer. We examined 213 SNPs previously associated with 14 other cancers for their associations with endometrial cancer in 3,758 endometrial cancer cases and 5,966 controls of European ancestry from two consortia: Population Architecture Using Genomics and Epidemiology (PAGE) and the Epidemiology of Endometrial Cancer Consortium (E2C2). Study-specific logistic regression estimates adjusted for age, body mass index, and the most significant principal components of genetic ancestry were combined using fixed-effect meta-analysis to evaluate the association between each SNP and endometrial cancer risk. A Bonferroni-corrected P-value of 2.35x10(-4) was used to determine statistical significance of the associations. SNP rs7679673, ~6.3 kb upstream of TET2 and previously reported to be associated with prostate cancer risk, was associated with endometrial cancer risk in the direction opposite to that for prostate cancer [meta-analysis odds ratio (OR)=0.87 (per copy of the C allele), 95% Confidence Interval (CI)=0.81, 0.93; P=7.37x10(-5)] with no evidence of heterogeneity across studies (P-heterogeneity=0.66). This pleiotropic analysis is the first to suggest TET2 as a susceptibility locus for endometrial cancer.
[Show abstract][Hide abstract] ABSTRACT: Mitochondria play a critical role in the cell and have DNA independent of the nuclear genome. There is much evidence that mitochondrial DNA (mtDNA) variation plays a role in human health and disease, however, this area of investigation has lagged behind research into the role of nuclear genetic variation on complex traits and phenotypic outcomes. Phenome-wide association studies (PheWAS) investigate the association between a wide range of traits and genetic variation. To date, this approach has not been used to investigate the relationship between mtDNA variants and phenotypic variation. Herein, we describe the development of a PheWAS framework for mtDNA variants (mt-PheWAS). Using the Metabochip custom genotyping array, nuclear and mitochondrial DNA variants were genotyped in 11,519 African Americans from the Vanderbilt University biorepository, BioVU. We employed both polygenic modeling and association testing with mitochondrial single nucleotide polymorphisms (mtSNPs) to explore the relationship between mtDNA variants and a group of eight cardiovascular-related traits obtained from de-identified electronic medical records within BioVU.
Using polygenic modeling we found evidence for an effect of mtDNA variation on total cholesterol and type 2 diabetes (T2D). After performing comprehensive mitochondrial single SNP associations, we identified an increased number of single mtSNP associations with total cholesterol and T2D compared to the other phenotypes examined, which did not have more significantly associated SNPs than would be expected by chance. Among the mtSNPs significantly associated with T2D we identified variant mt16189, an association previously reported only in Asian and European-descent populations.
Our replication of previous findings and identification of novel associations from this initial study suggest that our mt-PheWAS approach is robust for investigating the relationship between mitochondrial genetic variation and a range of phenotypes, providing a framework for future mt-PheWAS.
[Show abstract][Hide abstract] ABSTRACT: Genome-wide association studies have identified hundreds of genetic variants associated with specific cancers. A few of these risk regions have been associated with more than one cancer site; however, a systematic evaluation of the associations between risk variants for other cancers and lung cancer risk has yet to be performed.
We included 18023 patients with lung cancer and 60543 control subjects from two consortia, Population Architecture using Genomics and Epidemiology (PAGE) and Transdisciplinary Research in Cancer of the Lung (TRICL). We examined 165 single-nucleotide polymorphisms (SNPs) that were previously associated with at least one of 16 non-lung cancer sites. Study-specific logistic regression results underwent meta-analysis, and associations were also examined by race/ethnicity, histological cell type, sex, and smoking status. A Bonferroni-corrected P value of 2.5×10(-5) was used to assign statistical significance.
The breast cancer SNP LSP1 rs3817198 was associated with an increased risk of lung cancer (odds ratio [OR] = 1.10; 95% confidence interval [CI] = 1.05 to 1.14; P = 2.8×10(-6)). This association was strongest for women with adenocarcinoma (P = 1.2×10(-4)) and not statistically significant in men (P = .14) with this cell type (P het by sex = .10). Two glioma risk variants, TERT rs2853676 and CDKN2BAS1 rs4977756, which are located in regions previously associated with lung cancer, were associated with increased risk of adenocarcinoma (OR = 1.16; 95% CI = 1.10 to 1.22; P = 1.1×10(-8)) and squamous cell carcinoma (OR = 1.13; CI = 1.07 to 1.19; P = 2.5×10(-5)), respectively.
Our findings demonstrate a novel pleiotropic association between the breast cancer LSP1 risk region marked by variant rs3817198 and lung cancer risk.
[Show abstract][Hide abstract] ABSTRACT: -C-reactive protein (CRP) is a biomarker of inflammation. Genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) associated with CRP concentrations and inflammation-related traits such as cardiovascular disease, type 2 diabetes, and obesity. We aimed to replicate previous CRP-SNP associations, assess whether these associations generalize to additional race/ethnicity groups, and evaluate inflammation-related SNPs for a potentially pleiotropic association with CRP.
-We selected and analyzed 16 CRP-associated and 250 inflammation-related GWAS SNPs among 40,473 African American, American Indian, Asian/Pacific Islander, European American, and Hispanic participants from 7 studies collaborating in the Population Architecture using Genomics and Epidemiology (PAGE) study. Fixed-effect meta-analyses combined study-specific race/ethnicity-stratified linear regression estimates to evaluate the association between each SNP and high-sensitivity CRP. Overall, 18 SNPs in 8 loci were significantly associated with CRP (Bonferroni-corrected p<3.1x10(-3) for replication, p<2.0x10(-4) for pleiotropy): Seven of these were specific to European Americans, while 9 additionally generalized to African Americans (1), Hispanics (5), or both (3); 1 SNP was seen only in African Americans and Hispanics. Two SNPs in the CELSR2/PSRC1/SORT1 locus showed a potentially novel association with CRP: rs599839 (p=2.0x10(-6)) and rs646776 (p=3.1x10(-5)).
-We replicated 16 SNP-CRP associations, 10 of which generalized to African Americans and/or Hispanics. We also identified potentially novel pleiotropic associations with CRP for two SNPs previously associated with coronary artery disease and LDL cholesterol. These findings demonstrate the benefit of evaluating genotype-phenotype associations in multiple race/ethnicity groups, and of looking for pleiotropic relationships among SNPs previously associated with related phenotypes.
[Show abstract][Hide abstract] ABSTRACT: Mitochondrial DNA (mtDNA) haplogroups are valuable for investigations in forensic science, molecular anthropology, and human genetics. In this study, we developed a custom panel of 61 mtDNA markers for high-throughput classification of European, African, and Native American/Asian mitochondrial haplogroup lineages. Using these mtDNA markers, we constructed a mitochondrial haplogroup classification tree and classified 18,832 participants from the National Health and Nutrition Examination Surveys (NHANES). To our knowledge, this is the largest study to date characterizing mitochondrial haplogroups in a population-based sample from the United States, and the first study characterizing mitochondrial haplogroup distributions in self-identified Mexican Americans separately from Hispanic Americans of other descent. We observed clear differences in the distribution of maternal genetic ancestry consistent with proposed admixture models for these subpopulations, underscoring the genetic heterogeneity of the United States Hispanic population. The mitochondrial haplogroup distributions in the other self-identified racial/ethnic groups within NHANES were largely comparable to previous studies. Mitochondrial haplogroup classification was highly concordant with self-identified race/ethnicity (SIRE) in non-Hispanic whites (94.8 %), but was considerably lower in admixed populations including non-Hispanic blacks (88.3 %), Mexican Americans (81.8 %), and other Hispanics (61.6 %), suggesting SIRE does not accurately reflect maternal genetic ancestry, particularly in populations with greater proportions of admixture. Thus, it is important to consider inconsistencies between SIRE and genetic ancestry when performing genetic association studies. The mitochondrial haplogroup data that we have generated, coupled with the epidemiologic variables in NHANES, is a valuable resource for future studies investigating the contribution of mtDNA variation to human health and disease.
[Show abstract][Hide abstract] ABSTRACT: Calcineurin-inhibitors CI are immunosuppressive agents prescribed to patients after solid organ transplant to prevent rejection. Although these drugs have been transformative for allograft survival, long-term use is complicated by side effects including nephrotoxicity. Given the narrow therapeutic index of CI, therapeutic drug monitoring is used to prevent acute rejection from underdosing and acute toxicity from overdosing, but drug monitoring does not alleviate long-term side effects. Patients on calcineurin-inhibitors for long periods almost universally experience declines in renal function, and a subpopulation of transplant recipients ultimately develop chronic kidney disease that may progress to end stage renal disease attributable to calcineurin inhibitor toxicity (CNIT). Pharmacogenomics has the potential to identify patients who are at high risk for developing advanced chronic kidney disease caused by CNIT and providing them with existing alternate immunosuppressive therapy. In this study we utilized BioVU, Vanderbilt University Medical Center's DNA biorepository linked to de-identified electronic medical records to identify a cohort of 115 heart transplant recipients prescribed calcineurin-inhibitors to identify genetic risk factors for CNIT We identified 37 cases of nephrotoxicity in our cohort, defining nephrotoxicity as a monthly median estimated glomerular filtration rate (eGFR) <30 mL/min/1.73m2 at least six months post-transplant for at least three consecutive months. All heart transplant patients were genotyped on the Illumina ADME Core Panel, a pharmacogenomic genotyping platform that assays 184 variants across 34 genes. In Cox regression analysis adjusting for age at transplant, pre-transplant chronic kidney disease, pre-transplant diabetes, and the three most significant principal components (PCAs), we did not identify any markers that met our multiple-testing threshold. As a secondary analysis we also modeled post-transplant eGFR directly with linear mixed models adjusted for age at transplant, cyclosporine use, median BMI, and the three most significant principal components. While no SNPs met our threshold for significance, a SNP previously identified in genetic studies of the dosing of tacrolimus CYP34A rs776746, replicated in an adjusted analysis at an uncorrected p-value of 0.02 (coeff(S.E.) = 14.60(6.41)). While larger independent studies will be required to further validate this finding, this study underscores the EMRs usefulness as a resource for longitudinal pharmacogenetic study designs.
Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing. 01/2014; 19:253-64.
[Show abstract][Hide abstract] ABSTRACT: Postoperative atrial fibrillation (PoAF) after cardiac surgery is common and associated with increased morbidity and mortality. Increased sympathetic activation after surgery contributes to PoAF, and β-blockers are the first-line recommendation for its prevention. We examined the hypothesis that common functional genetic variants in the β1-adrenoreceptor, the mediator of cardiac sympathetic activation and drug target of β-blockers, are associated with the risk for PoAF and with the protective effect of β-blockers.
In a prospective cohort study, we studied 947 adult European Americans who underwent cardiac surgery at Vanderbilt University between 1999 and 2005. We genotyped 2 variants in the β1-adrenoreceptor, rs1801253 (Arg389Gly) and rs1801252 (Ser49Gly), and used logistic regression to examine the association between genotypes and PoAF occurring within 14 days after surgery, before and after adjustment for demographic and clinical covariates.
Postoperative atrial fibrillation occurred in 239 patients (25.2%) and was associated with rs1801253 genotype (adjusted P = .008), with Gly389Gly having an odds ratio of 2.63 (95% CI 1.42-4.89) for PoAF compared to the common Arg389Arg (P = .002). In a predefined subgroup analysis, this association appeared to be stronger among patients without β-blocker prophylaxis (adjusted odds ratio 7.00, 95% CI 1.82-26.96, P = .005) compared to patients with β-blocker prophylaxis, among whom the association between rs1801253 genotype and PoAF was not statistically significant (adjusted P = .11).
The Gly389 variant in the β1-adrenoreceptor is associated with PoAF, and this association appears to be modulated by β-blocker therapy. Future studies of the association of other adrenergic pathway genes with PoAF will be of interest.
American heart journal 01/2014; 167(1):101-108.e1. · 4.65 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Measures of cardiac structure and function are important human phenotypes that are associated with a range of clinical outcomes. Studying these traits in large populations can be time consuming and costly. Utilizing data from large electronic medical records (EMRs) is one possible solution to this problem. We describe the extraction and filtering of quantitative transthoracic echocardiographic data from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study, a large, racially diverse, EMR-based cohort (n = 15,863).
Journal of clinical bioinformatics. 01/2014; 4:12.