Genome-wide association scan identifies a risk locus for preeclampsia on 2q14, near the inhibin, beta B gene.

Department of Genetics, Texas Biomedical Research Institute, San Antonio, Texas, United States of America.
PLoS ONE (Impact Factor: 3.53). 01/2012; 7(3):e33666. DOI: 10.1371/journal.pone.0033666
Source: PubMed

ABSTRACT Elucidating the genetic architecture of preeclampsia is a major goal in obstetric medicine. We have performed a genome-wide association study (GWAS) for preeclampsia in unrelated Australian individuals of Caucasian ancestry using the Illumina OmniExpress-12 BeadChip to successfully genotype 648,175 SNPs in 538 preeclampsia cases and 540 normal pregnancy controls. Two SNP associations (rs7579169, p = 3.58×10(-7), OR = 1.57; rs12711941, p = 4.26×10(-7), OR = 1.56) satisfied our genome-wide significance threshold (modified Bonferroni p<5.11×10(-7)). These SNPs reside in an intergenic region less than 15 kb downstream from the 3' terminus of the Inhibin, beta B (INHBB) gene on 2q14.2. They are in linkage disequilibrium (LD) with each other (r(2) = 0.92), but not (r(2)<0.80) with any other genotyped SNP ±250 kb. DNA re-sequencing in and around the INHBB structural gene identified an additional 25 variants. Of the 21 variants that we successfully genotyped back in the case-control cohort the most significant association observed was for a third intergenic SNP (rs7576192, p = 1.48×10(-7), OR = 1.59) in strong LD with the two significant GWAS SNPs (r(2)>0.92). We attempted to provide evidence of a putative regulatory role for these SNPs using bioinformatic analyses and found that they all reside within regions of low sequence conservation and/or low complexity, suggesting functional importance is low. We also explored the mRNA expression in decidua of genes ±500 kb of INHBB and found a nominally significant correlation between a transcript encoded by the EPB41L5 gene, ∼250 kb centromeric to INHBB, and preeclampsia (p = 0.03). We were unable to replicate the associations shown by the significant GWAS SNPs in case-control cohorts from Norway and Finland, leading us to conclude that it is more likely that these SNPs are in LD with as yet unidentified causal variant(s).

  • [Show abstract] [Hide abstract]
    ABSTRACT: Preeclampsia encompasses multiple conditions of varying severity. We examined the recurrence and familial aggregation of preeclampsia by timing of onset, which is a marker for severity. We ascertained personal and family histories of preeclampsia for women who delivered live singletons in Denmark in 1978-2008 (almost 1.4 million pregnancies). Using log-linear binomial regression, we estimated risk ratios for the associations between personal and family histories of preeclampsia and the risk of early-onset (before 34 weeks of gestation, which is typically the most severe), intermediate-onset (at 34-36 weeks of gestation), and late-onset (after 36 weeks of gestation) preeclampsia. Previous early-, intermediate-, or late-onset preeclampsia increased the risk of recurrent preeclampsia with the same timing of onset 25.2 times (95% confidence interval (CI): 21.8, 29.1), 19.7 times (95% CI: 17.0, 22.8), and 10.3 times (95% CI: 9.85, 10.9), respectively, compared with having no such history. Preeclampsia in a woman's family was associated with a 24%-163% increase in preeclampsia risk, with the strongest associations for early- and intermediate-onset preeclampsia in female relatives. Preeclampsia in the man's family did not affect a woman's risk of early-onset preeclampsia and was only weakly associated with her risks of intermediate- and late-onset preeclampsia. Early-onset preeclampsia appears to have the largest genetic component, whereas environmental factors likely contribute most to late-onset preeclampsia. The role of paternal genes in the etiology of preeclampsia appears to be limited.
    American journal of epidemiology 09/2013; · 5.59 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Systematic data management and controlled data sharing aim at increasing reproducibility, reducing redundancy in work, and providing a way to efficiently locate complementing or contradicting information. One method of achieving this is collecting data in a central repository or in a location that is part of a federated system and providing interfaces to the data. However, certain data, such as data from biobanks or clinical studies, may, for legal and privacy reasons, often not be stored in public repositories. Instead, we describe a metadata cataloguing system and a software suite for reporting the presence of data from the life sciences domain. The system stores three types of metadata: file information, file provenance and data lineage, and content descriptions. Our software suite includes both graphical and command line interfaces that allow users to report and tag files with these different metadata types. Importantly, the files remain in their original locations with their existing access-control mechanisms in place, while our system provides descriptions of their contents and relationships. Our system and software suite thereby provide a common framework for cataloguing and sharing both public and private data. Database URL:
    Database The Journal of Biological Databases and Curation 01/2014; 2014:bau027. · 4.20 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Genetic association studies, in particular the genome-wide association study (GWAS) design, have provided a wealth of novel insights into the aetiology of a wide range of human diseases and traits, in particular cardiovascular diseases and lipid biomarkers. The next challenge consists of understanding the molecular basis of these associations. The integration of multiple association datasets, including gene expression datasets, can contribute to this goal. We have developed a novel statistical methodology to assess whether two association signals are consistent with a shared causal variant. An application is the integration of disease scans with expression quantitative trait locus (eQTL) studies, but any pair of GWAS datasets can be integrated in this framework. We demonstrate the value of the approach by re-analysing a gene expression dataset in 966 liver samples with a published meta-analysis of lipid traits including >100,000 individuals of European ancestry. Combining all lipid biomarkers, our re-analysis supported 26 out of 38 reported colocalisation results with eQTLs and identified 14 new colocalisation results, hence highlighting the value of a formal statistical test. In three cases of reported eQTL-lipid pairs (SYPL2, IFT172, TBKBP1) for which our analysis suggests that the eQTL pattern is not consistent with the lipid association, we identify alternative colocalisation results with SORT1, GCKR, and KPNB1, indicating that these genes are more likely to be causal in these genomic intervals. A key feature of the method is the ability to derive the output statistics from single SNP summary statistics, hence making it possible to perform systematic meta-analysis type comparisons across multiple GWAS datasets (implemented online at Our methodology provides information about candidate causal genes in associated intervals and has direct implications for the understanding of complex diseases as well as the design of drugs to target disease pathways.
    PLoS Genetics 05/2014; 10(5):e1004383. · 8.52 Impact Factor

Full-text (2 Sources)

Available from
May 17, 2014