E J van den Oord

Virginia Commonwealth University, Ричмонд, Virginia, United States

Are you E J van den Oord?

Claim your profile

Publications (80)553.07 Total impact

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Genome-wide association study meta-analyses have robustly implicated three loci that affect susceptibility for smoking: CHRNA5\CHRNA3\CHRNB4, CHRNB3\CHRNA6 and EGLN2\CYP2A6. Functional follow-up studies of these loci are needed to provide insight into biological mechanisms. However, these efforts have been hampered by a lack of knowledge about the specific causal variant(s) involved. In this study, we prioritized variants in terms of the likelihood they account for the reported associations. We employed targeted capture of the CHRNA5\CHRNA3\CHRNB4, CHRNB3\CHRNA6, and EGLN2\CYP2A6 loci and flanking regions followed by next-generation deep sequencing (mean coverage 78×) to capture genomic variation in 363 individuals. We performed single locus tests to determine if any single variant accounts for the association, and examined if sets of (rare) variants that overlapped with biologically meaningful annotations account for the associations. In total, we investigated 963 variants, of which 71.1% were rare (minor allele frequency < 0.01), 6.02% were insertion/deletions, and 51.7% were catalogued in dbSNP141. The single variant results showed that no variant fully accounts for the association in any region. In the variant set results, CHRNB4 accounts for most of the signal with significant sets consisting of directly damaging variants. CHRNA6 explains most of the signal in the CHRNB3\CHRNA6 locus with significant sets indicating a regulatory role for CHRNA6. Significant sets in CYP2A6 involved directly damaging variants while the significant variant sets suggested a regulatory role for EGLN2. We found that multiple variants implicating multiple processes explain the signal. Some variants can be prioritized for functional follow-up. © The Author 2015. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
    Nicotine & Tobacco Research 08/2015; DOI:10.1093/ntr/ntv166 · 3.30 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Haloperidol is an effective antipsychotic drug for treatment of schizophrenia, but prolonged use can lead to debilitating side effects. To better understand the effects of long-term administration, we measured global metabolic changes in mouse brain following 3 mg/kg/day haloperidol for 28 days. These conditions lead to movement-related side effects in mice akin to those observed in patients after prolonged use. Brain tissue was collected following microwave tissue fixation to arrest metabolism and extracted metabolites were assessed using both liquid and gas chromatography mass spectrometry (MS). Over 300 unique compounds were identified across MS platforms. Haloperidol was found to be present in all test samples and not in controls, indicating experimental validity. Twenty-one compounds differed significantly between test and control groups at the p < 0.05 level. Top compounds were robust to analytical method, also being identified via partial least squares discriminant analysis. Four compounds (sphinganine, N-acetylornithine, leucine and adenosine diphosphate) survived correction for multiple testing in a non-parametric analysis using false discovery rate threshold < 0.1. Pathway analysis of nominally significant compounds (p < 0.05) revealed significant findings for sphingolipid metabolism (p = 0.015) and protein biosynthesis (p = 0.024). Altered sphingolipid metabolism is suggestive of disruptions to myelin. This interpretation is supported by our observation of elevated N-acetyl-aspartyl-glutamate in the haloperidol-treated mice (p = 0.004), a marker previously associated with demyelination. This study further demonstrates the utility of murine neurochemical metabolomics as a method to advance understanding of CNS drug effects.
    Journal of Neuroimmune Pharmacology 04/2015; 10(3). DOI:10.1007/s11481-015-9605-1 · 4.11 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Genome-wide association studies have identified multiple risk variants and loci that show robust association with schizophrenia. Nevertheless, it remains unclear how these variants confer risk to schizophrenia. In addition, the driving force that maintains the schizophrenia risk variants in human gene pool is poorly understood. To investigate whether expression-associated genetic variants contribute to schizophrenia susceptibility, we systematically integrated brain expression quantitative trait loci and genome-wide association data of schizophrenia using Sherlock, a Bayesian statistical framework. Our analyses identified ZNF323 as a schizophrenia risk gene (P = 2.22×10–6). Subsequent analyses confirmed the association of the ZNF323 and its expression-associated single nucleotide polymorphism rs1150711 in independent samples (gene-expression: P = 1.40×10–6; single-marker meta-analysis in the combined discovery and replication sample comprising 44123 individuals: P = 6.85×10−10). We found that the ZNF323 was significantly downregulated in hippocampus and frontal cortex of schizophrenia patients (P = .0038 and P = .0233, respectively). Evidence for pleiotropic effects was detected (association of rs1150711 with lung function and gene expression of ZNF323 in lung: P = 6.62×10–5 and P = 9.00×10–5, respectively) with the risk allele (T allele) for schizophrenia acting as protective allele for lung function. Subsequent population genetics analyses suggest that the risk allele (T) of rs1150711 might have undergone recent positive selection in human population. Our findings suggest that the ZNF323 is a schizophrenia susceptibility gene whose expression may influence schizophrenia risk. Our study also illustrates a possible mechanism for maintaining schizophrenia risk variants in the human gene pool.
    Schizophrenia Bulletin 03/2015; DOI:10.1093/schbul/sbv017 · 8.45 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Converging evidence implicates immune abnormalities in schizophrenia (SCZ), and recent genome-wide association studies (GWAS) have identified immune-related single-nucleotide polymorphisms (SNPs) associated with SCZ. Using the conditional false discovery rate (FDR) approach, we evaluated pleiotropy in SNPs associated with SCZ (n=21 856) and multiple sclerosis (MS) (n=43 879), an inflammatory, demyelinating disease of the central nervous system. Because SCZ and bipolar disorder (BD) show substantial clinical and genetic overlap, we also investigated pleiotropy between BD (n=16 731) and MS. We found significant genetic overlap between SCZ and MS and identified 21 independent loci associated with SCZ, conditioned on association with MS. This enrichment was driven by the major histocompatibility complex (MHC). Importantly, we detected the involvement of the same human leukocyte antigen (HLA) alleles in both SCZ and MS, but with an opposite directionality of effect of associated HLA alleles (that is, MS risk alleles were associated with decreased SCZ risk). In contrast, we found no genetic overlap between BD and MS. Considered together, our findings demonstrate genetic pleiotropy between SCZ and MS and suggest that the MHC signals may differentiate SCZ from BD susceptibility.
    Molecular Psychiatry 02/2015; 20(2):207-214. DOI:10.1038/mp.2013.195 · 14.50 Impact Factor
  • Andrey A Shabalin · Karolina A Aberg · Edwin Jcg van den Oord ·

    Epigenomics 02/2015; 7(1):13-5. DOI:10.2217/epi.14.70 · 4.65 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Anxiety disorders (ADs) are common mental disorders caused by a combination of genetic and environmental factors. Since ADs are highly comorbid with each other, partially due to shared genetic basis, studying AD phenotypes in a coordinated manner may be a powerful strategy for identifying potential genetic loci for ADs. To detect these loci, we performed genome-wide association studies (GWAS) of ADs. In addition, as a complementary approach to single-locus analysis, we also conducted gene- and pathway-based analyses. GWAS data were derived from the control sample of the Molecular Genetics of Schizophrenia (MGS) project (2,540 European American and 849 African American subjects) genotyped on the Affymetrix GeneChip 6.0 array. We applied two phenotypic approaches: (1) categorical case-control comparisons (CC) based upon psychiatric diagnoses, and (2) quantitative phenotypic factor scores (FS) derived from a multivariate analysis combining information across the clinical phenotypes. Linear and logistic models were used to analyse the association with ADs using FS and CC traits, respectively. At the single locus level, no genome-wide significant association was found. A trans-population gene-based meta-analysis across both ethnic subsamples using FS identified three genes (MFAP3L on 4q32.3, NDUFAB1 and PALB2 on 16p12) with genome-wide significance (false discovery rate (FDR] <5%). At the pathway level, several terms such as transcription regulation, cytokine binding, and developmental process were significantly enriched in ADs (FDR <5%). Our approaches studying ADs as quantitative traits and utilizing the full GWAS data may be useful in identifying susceptibility genes and pathways for ADs.
    PLoS ONE 11/2014; 9(11):e112559. DOI:10.1371/journal.pone.0112559 · 3.23 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Genome-wide association studies (GWAS) of schizophrenia have identified multiple risk variants with robust association signals for schizophrenia. However, these variants could explain only a small proportion of schizophrenia heritability. Furthermore, the effect size of these risk variants is relatively small (eg, most of them had an OR less than 1.2), suggesting that additional risk variants may be detected when increasing sample size in analysis. Here, we report the identification of a genome-wide significant schizophrenia risk locus at 22q13.1 by combining 2 large-scale schizophrenia cohort studies. Our meta-analysis revealed that 7 single nucleotide polymorphism (SNPs) on chromosome 22q13.1 reached the genome-wide significance level (P < 5.0×10–8) in the combined samples (a total of 38441 individuals). Among them, SNP rs6001946 had the most significant association with schizophrenia (P = 2.04×10–8). Interestingly, all 7 SNPs are in high linkage disequilibrium and located in the MKL1 gene. Expression analysis showed that MKL1 is highly expressed in human and mouse brains. We further investigated functional links between MKL1 and proteins encoded by other schizophrenia susceptibility genes in the whole human protein interaction network. We found that MKL1 physically interacts with GSK3B, a protein encoded by a well-characterized schizophrenia susceptibility gene. Collectively, our results revealed that genetic variants in MKL1 might confer risk to schizophrenia. Further investigation of the roles of MKL1 in the pathogenesis of schizophrenia is warranted.
    Schizophrenia Bulletin 11/2014; 41(3). DOI:10.1093/schbul/sbu156 · 8.45 Impact Factor

  • Source
    Y Kim · K Xia · R Tao · P Giusti-Rodriguez · V Vladimirov · E van den Oord · P F Sullivan ·
    [Show abstract] [Hide abstract]
    ABSTRACT: Current catalogs of brain expression quantitative trait loci (eQTL) are incomplete and the findings do not replicate well across studies. All existing cortical eQTL studies are small and emphasize the need for a meta-analysis. We performed a meta-analysis of 424 brain samples across five studies to identify regulatory variants influencing gene expression in human cortex. We identified 3584 genes in autosomes and chromosome X with false discovery rate q<0.05 whose expression was significantly associated with DNA sequence variation. Consistent with previous eQTL studies, local regulatory variants tended to occur symmetrically around transcription start sites and the effect was more evident in studies with large sample sizes. In contrast to random SNPs, we observed that significant eQTLs were more likely to be near 5'-untranslated regions and intersect with regulatory features. Permutation-based enrichment analysis revealed that SNPs associated with schizophrenia and bipolar disorder were enriched among brain eQTLs. Genes with significant eQTL evidence were also strongly associated with diseases from OMIM (Online Mendelian Inheritance in Man) and the NHGRI (National Human Genome Research Institute) genome-wide association study catalog. Surprisingly, we found that a large proportion (28%) of ~1000 autosomal genes encoding proteins needed for mitochondrial structure or function were eQTLs (enrichment P-value=1.3 × 10(-)(9)), suggesting a potential role for common genetic variation influencing the robustness of energy supply in brain and a possible role in the etiology of some psychiatric disorders. These systematically generated eQTL information should be a valuable resource in determining the functional mechanisms of brain gene expression and the underlying biology of associations with psychiatric disorders.
    Translational Psychiatry 10/2014; 4(10):e459. DOI:10.1038/tp.2014.96 · 5.62 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Clozapine is a particularly effective antipsychotic medication but its use is curtailed by the risk of clozapine-induced agranulocytosis/granulocytopenia (CIAG), a severe adverse drug reaction occurring in up to 1% of treated individuals. Identifying genetic risk factors for CIAG could enable safer and more widespread use of clozapine. Here we perform the largest and most comprehensive genetic study of CIAG to date by interrogating 163 cases using genome-wide genotyping and whole-exome sequencing. We find that two loci in the major histocompatibility complex are independently associated with CIAG: a single amino acid in HLA-DQB1 (126Q) (P ¼ 4.7 Â 10 À 14 , odds ratio (OR) ¼ 0.19, 95% confidence interval (CI) ¼ 0.12–0.29) and an amino acid change in the extracellular binding pocket of HLA-B (158T) (P ¼ 6.4 Â 10 À 10 , OR ¼ 3.3, 95% CI ¼ 2.3–4.9). These associations dovetail with the roles of these genes in immunogenetic phenotypes and adverse drug responses for other medications, and provide insight into the pathophysiology of CIAG.
    Nature Communications 09/2014; 1718192123252628(27). DOI:10.1038/ncomms5757 · 11.47 Impact Factor
  • Karolina A Aberg · Edwin Jcg van den Oord ·

    Biomarkers in Medicine 06/2014; 8(5):607-11. DOI:10.2217/bmm.14.44 · 2.65 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: IMPORTANCE We investigated the variation in neuropsychological function explained by risk alleles at the psychosis susceptibility gene ZNF804A and its interacting partners using single nucleotide polymorphisms (SNPs), polygenic scores, and epistatic analyses. Of particular importance was the relative contribution of the polygenic score vs epistasis in variation explained. OBJECTIVES To (1) assess the association between SNPs in ZNF804A and the ZNF804A polygenic score with measures of cognition in cases with psychosis and (2) assess whether epistasis within the ZNF804A pathway could explain additional variation above and beyond that explained by the polygenic score. DESIGN, SETTING, AND PARTICIPANTS Patients with psychosis (n = 424) were assessed in areas of cognitive ability impaired in schizophrenia including IQ, memory, attention, and social cognition. We used the Psychiatric GWAS Consortium 1 schizophrenia genome-wide association study to calculate a polygenic score based on identified risk variants within this genetic pathway. Cognitive measures significantly associated with the polygenic score were tested for an epistatic component using a training set (n = 170), which was used to develop linear regression models containing the polygenic score and 2-SNP interactions. The best-fitting models were tested for replication in 2 independent test sets of cases: (1) 170 individuals with schizophrenia or schizoaffective disorder and (2) 84 patients with broad psychosis (including bipolar disorder, major depressive disorder, and other psychosis). MAIN OUTCOMES AND MEASURES Participants completed a neuropsychological assessment battery designed to target the cognitive deficits of schizophrenia including general cognitive function, episodic memory, working memory, attentional control, and social cognition. RESULTS Higher polygenic scores were associated with poorer performance among patients on IQ, memory, and social cognition, explaining 1% to 3% of variation on these scores (range, P = .01 to .03). Using a narrow psychosis training set and independent test sets of narrow phenotype psychosis (schizophrenia and schizoaffective disorder), broad psychosis, and control participants (n = 89), the addition of 2 interaction terms containing 2 SNPs each increased the R2 for spatial working memory strategy in the independent psychosis test sets from 1.2% using the polygenic score only to 4.8% (P = .11 and .001, respectively) but did not explain additional variation in control participants. CONCLUSIONS AND RELEVANCE These data support a role for the ZNF804A pathway in IQ, memory, and social cognition in cases. Furthermore, we showed that epistasis increases the variation explained above the contribution of the polygenic score.
    JAMA Psychiatry 05/2014; DOI:10.1001/jamapsychiatry.2014.528 · 12.01 Impact Factor
  • Source

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Most psychiatric disorders are moderately to highly heritable. The degree to which genetic variation is unique to individual disorders or shared across disorders is unclear. To examine shared genetic etiology, we use genome-wide genotype data from the Psychiatric Genomics Consortium (PGC) for cases and controls in schizophrenia, bipolar disorder, major depressive disorder, autism spectrum disorders (ASD) and attention-deficit/hyperactivity disorder (ADHD). We apply univariate and bivariate methods for the estimation of genetic variation within and covariation between disorders. SNPs explained 17–29% of the variance in liability. The genetic correlation calculated using common SNPs was high between schizophrenia and bipolar disorder (0.68 ± 0.04 s.e.), moderate between schizophrenia and major depressive disorder (0.43 ± 0.06 s.e.), bipolar disorder and major depressive disorder (0.47 ± 0.06 s.e.), and ADHD and major depressive disorder (0.32 ± 0.07 s.e.), low between schizophrenia and ASD (0.16 ± 0.06 s.e.) and non-significant for other pairs of disorders as well as between psychiatric disorders and the negative control of Crohn’s disease. This empirical evidence of shared genetic etiology for psychiatric disorders can inform nosology and encourages the investigation of common pathophysiologies for related disorders.
    Nature Genetics 08/2013; AOP. DOI:10.1038/ng.2711 · 29.35 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Aim: As the primary relevant tissue (brain) for psychiatric disorders is commonly not available, we aimed to investigate whether blood can be used as a proxy in methylation studies on the basis of two models. In the 'signature' model methylation-disease associations occur because a disease-causing factor affected methylation in the blood. In the 'mirror-site' model the methylation status in the blood is correlated with the corresponding disease-causing site in the brain. MATERIALS, METHODS & RESULTS: Methyl-binding domain enrichment and next-generation sequencing of the blood, cortex and hippocampus from four haloperidol-treated and ten untreated C57BL/6 mice revealed high levels of correlation in methylation across tissues. Despite the treatment inducing a large number of methylation changes, this correlation remains high. Conclusion: Our results show that, consistent with the signature model, factors that affect brain processes (i.e., haloperidol) leave biomarker signatures in the blood and, consistent with the mirror-site model, the methylation status of many sites in the blood mirror those in the brain.
    Epigenomics 08/2013; 5(4):367-77. DOI:10.2217/epi.13.36 · 4.65 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Integrating evidence from multiple domains is useful in prioritizing disease candidate genes for subsequent testing. We ranked all known human genes (n = 3819) under linkage peaks in the Irish Study of High-Density Schizophrenia Families using three different evidence domains: 1) a meta-analysis of microarray gene expression results using the Stanley Brain collection, 2) a schizophrenia protein-protein interaction network, and 3) a systematic literature search. Each gene was assigned a domain-specific p-value and ranked after evaluating the evidence within each domain. For comparison to this ranking process, a large-scale candidate gene hypothesis was also tested by including genes with Gene Ontology terms related to neurodevelopment. Subsequently, genotypes of 3725 SNPs in 167 genes from a custom Illumina iSelect array were used to evaluate the top ranked vs. hypothesis selected genes. Seventy-three genes were both highly ranked and involved in neurodevelopment (category 1) while 42 and 52 genes were exclusive to neurodevelopment (category 2) or highly ranked (category 3), respectively. The most significant associations were observed in genes PRKG1, PRKCE, and CNTN4 but no individual SNPs were significant after correction for multiple testing. Comparison of the approaches showed an excess of significant tests using the hypothesis-driven neurodevelopment category. Random selection of similar sized genes from two independent genome-wide association studies (GWAS) of schizophrenia showed the excess was unlikely by chance. In a further meta-analysis of three GWAS datasets, four candidate SNPs reached nominal significance. Although gene ranking using integrated sources of prior information did not enrich for significant results in the current experiment, gene selection using an a priori hypothesis (neurodevelopment) was superior to random selection. As such, further development of gene ranking strategies using more carefully selected sources of information is warranted.
    PLoS ONE 07/2013; 8(7):67776-. DOI:10.1371/journal.pone.0067776 · 3.23 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Schizophrenia (SCZ) is a devastating psychiatric condition. Identifying the specific genetic variants and pathways that increase susceptibility to SCZ is critical to improve disease understanding and address the urgent need for new drug targets. To identify SCZ susceptibility genes. We integrated results from a meta-analysis of 18 genome-wide association studies (GWAS) involving 1,085,772 single-nucleotide polymorphisms (SNPs) and 6 databases that showed significant informativeness for SCZ. The 9380 most promising SNPs were then specifically genotyped in an independent family-based replication study that, after quality control, consisted of 8107 SNPs. Linkage meta-analysis, brain transcriptome meta-analysis, candidate gene database, OMIM, relevant mouse studies, and expression quantitative trait locus databases. We included 11,185 cases and 10,768 control subjects from 6 databases and, after quality control 6298 individuals (including 3286 cases) from 1811 nuclear families. Case-control status for SCZ. Replication results showed a highly significant enrichment of SNPs with small P values. Of the SNPs with replication values of P.01, the proportion of SNPs that had the same direction of effects as in the GWAS meta-analysis was 89% in the combined ancestry group (sign test, P < 2.20 x 10(-16) and 93% in subjects of European ancestry only (P < 2.20 < 10(-16)). Our results supported the major histocompatibility complex region showing a3.7-fold overall enrichment of replication values of P < .01 in subjects from European ancestry. We replicated SNPs in TCF4 (P = 2.53 x 10(-10)) and NOTCH4 (P = 3.16 x 10(-7)) that are among the most robust SCZ findings. More novel findings included POM121L2 (P = 3.51 x 10(-7)), AS3MT (P = 9.01 x 10(-7)), CNNM2 (P = 6.07 = 10(-7)), and NT5C2(P = 4.09 x 10(-7)). To explore the many small effects, we performed pathway analyses. The most significant pathways involved neuronal function (axonal guidance, neuronal systems, and L1 cell adhesion molecule interaction)and the immune system (antigen processing, cell adhesion molecules relevant to T cells, and translocation to immunological synapse). We replicated novel SCZ disease genes and pathogenic pathways. Better understanding the molecular and biological mechanisms involved with schizophrenia may improve disease management and may identify new drug targets.
    Archives of General Psychiatry 06/2013; 70(6):573-81. DOI:10.1001/jamapsychiatry.2013.288 · 14.48 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The importance of including developmental and environmental measures in genetic studies of human pathology is widely acknowledged, but few empirical studies have been published. Barriers include the need for longitudinal studies that cover relevant developmental stages and for samples large enough to deal with the challenge of testing gene-environment-development interaction. A solution to some of these problems is to bring together existing data sets that have the necessary characteristics. As part of the National Institute on Drug Abuse-funded Gene-Environment-Development Initiative, our goal is to identify exactly which genes, which environments, and which developmental transitions together predict the development of drug use and misuse. Four data sets were used of which common characteristics include (1) general population samples, including males and females; (2) repeated measures across adolescence and young adulthood; (3) assessment of nicotine, alcohol, and cannabis use and addiction; (4) measures of family and environmental risk; and (5) consent for genotyping DNA from blood or saliva. After quality controls, 2,962 individuals provided over 15,000 total observations. In the first gene-environment analyses, of alcohol misuse and stressful life events, some significant gene-environment and gene-development effects were identified. We conclude that in some circumstances, already collected data sets can be combined for gene-environment and gene-development analyses. This greatly reduces the cost and time needed for this type of research. However, care must be taken to ensure careful matching across studies and variables.
    Twin Research and Human Genetics 03/2013; 16(02):1-11. DOI:10.1017/thg.2013.6 · 2.30 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Background In methylome-wide association studies (MWAS) there are many possible differences between cases and controls (e.g. related to life style, diet, and medication use) that may affect the methylome and produce false positive findings. An effective approach to control for these confounders is to first capture the major sources of variation in the methylation data and then regress out these components in the association analyses. This approach is, however, computationally very challenging due to the extremely large number of methylation sites in the human genome. Result We introduce MethylPCA that is specifically designed to control for potential confounders in studies where the number of methylation sites is extremely large. MethylPCA offers a complete and flexible data analysis including 1) an adaptive method that performs data reduction prior to PCA by empirically combining methylation data of neighboring sites, 2) an efficient algorithm that performs a principal component analysis (PCA) on the ultra high-dimensional data matrix, and 3) association tests. To accomplish this MethylPCA allows for parallel execution of tasks, uses C++ for CPU and I/O intensive calculations, and stores intermediate results to avoid computing the same statistics multiple times or keeping results in memory. Through simulations and an analysis of a real whole methylome MBD-seq study of 1,500 subjects we show that MethylPCA effectively controls for potential confounders. Conclusions MethylPCA provides users a convenient tool to perform MWAS. The software effectively handles the challenge in memory and speed to perform tasks that would be impossible to accomplish using existing software when millions of sites are interrogated with the sample sizes required for MWAS.
    BMC Bioinformatics 03/2013; 14(1):74. DOI:10.1186/1471-2105-14-74 · 2.58 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Background Methylation studies are a promising complement to genetic studies of DNA sequence. However, detailed prior biological knowledge is typically lacking, so methylome-wide association studies (MWAS) will be critical to detect disease relevant sites. A cost-effective approach involves the next-generation sequencing (NGS) of single-end libraries created from samples that are enriched for methylated DNA fragments. A limitation of single-end libraries is that the fragment size distribution is not observed. This hampers several aspects of the data analysis such as the calculation of enrichment measures that are based on the number of fragments covering the CpGs. Results We developed a non-parametric method that uses isolated CpGs to estimate sample-specific fragment size distributions from the empirical sequencing data. Through simulations we show that our method is highly accurate. While the traditional (extended) read count methods resulted in severely biased coverage estimates and introduces artificial inter-individual differences, through the use of the estimated fragment size distributions we could remove these biases almost entirely. Furthermore, we found correlations of 0.999 between coverage estimates obtained using fragment size distributions that were estimated with our method versus those that were “observed” in paired-end sequencing data. Conclusions We propose a non-parametric method for estimating fragment size distributions that is highly precise and can improve the analysis of cost-effective MWAS studies that sequence single-end libraries created from samples that are enriched for methylated DNA fragments.
    BMC Bioinformatics 02/2013; 14(1):50. DOI:10.1186/1471-2105-14-50 · 2.58 Impact Factor

Publication Stats

4k Citations
553.07 Total Impact Points


  • 2001-2015
    • Virginia Commonwealth University
      • • Center for Biomarker Research and Personalized Medicine,
      • • School of Pharmacy
      • • Department of Psychiatry
      • • Department of Pharmacology and Toxicology
      • • Virginia Institute for Psychiatric and Behavioral Genetics
      Ричмонд, Virginia, United States
  • 2005
    • Washington DC VA Medical Center
      Washington, Washington, D.C., United States
  • 1997-2001
    • Universiteit Utrecht
      • Department of Child and Adolescent Psychiatry
      Utrecht, Utrecht, Netherlands
  • 1996-2001
    • University of Amsterdam
      • Department of Psychonomics
      Amsterdamo, North Holland, Netherlands
  • 1995-2001
    • Erasmus Universiteit Rotterdam
      • Department of Child and Adolescent Psychiatry
      Rotterdam, South Holland, Netherlands
  • 2000
    • London Research Institute
      Londinium, England, United Kingdom
  • 1970
    • University of Oklahoma
      • Department of Psychology
      Oklahoma City, OK, United States