GWAMA: software for genome-wide association meta-analysis.

Genetic and Genomic Epidemiology Unit, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK.
BMC Bioinformatics (Impact Factor: 2.67). 01/2010; 11:288. DOI: 10.1186/1471-2105-11-288
Source: PubMed Central

ABSTRACT Despite the recent success of genome-wide association studies in identifying novel loci contributing effects to complex human traits, such as type 2 diabetes and obesity, much of the genetic component of variation in these phenotypes remains unexplained. One way to improving power to detect further novel loci is through meta-analysis of studies from the same population, increasing the sample size over any individual study. Although statistical software analysis packages incorporate routines for meta-analysis, they are ill equipped to meet the challenges of the scale and complexity of data generated in genome-wide association studies.
We have developed flexible, open-source software for the meta-analysis of genome-wide association studies. The software incorporates a variety of error trapping facilities, and provides a range of meta-analysis summary statistics. The software is distributed with scripts that allow simple formatting of files containing the results of each association study and generate graphical summaries of genome-wide meta-analysis results.
The GWAMA (Genome-Wide Association Meta-Analysis) software has been developed to perform meta-analysis of summary statistics generated from genome-wide association studies of dichotomous phenotypes or quantitative traits. Software with source files, documentation and example data files are freely available online at

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Results from numerous linkage and association studies have greatly deepened scientists’ understanding of the genetic basis of many human diseases, yet some important questions remain unanswered. For example, although a large number of disease-associated loci have been identified from genome-wide association studies (GWAS) in the past 10 years, it is challenging to interpret these results as most disease-associated markers have no clear functional roles in disease etiology, and all the identified genomic factors only explain a small portion of disease heritability. With the help of next-generation sequencing (NGS), diverse types of genomic and epigenetic variations can be detected with high accuracy. More importantly, instead of using linkage disequilibrium to detect association signals based on a set of pre-set probes, NGS allows researchers to directly study all the variants in each individual, therefore promises opportunities for identifying functional variants and a more comprehensive dissection of disease heritability. Although the current scale of NGS studies is still limited due to the high cost, the success of several recent studies suggests the great potential for applying NGS in genomic epidemiology, especially as the cost of sequencing continues to drop. In this review, we discuss several pioneer applications of NGS, summarize scientific discoveries for rare and complex diseases, and compare various study designs including targeted sequencing and whole-genome sequencing using population-based and family-based cohorts. Finally, we highlight recent advancements in statistical methods proposed for sequencing analysis, including group-based association tests, meta-analysis techniques, and annotation tools for variant prioritization.
    Frontiers in Genetics 04/2015; 6(149). DOI:10.3389/fgene.2015.00149
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Background Despite moderate heritability estimates for depression-related phenotypes, few robust genetic predictors have been identified. Potential explanations for this discrepancy include the use of phenotypic measures taken from a single time point, rather than integrating information over longer time periods via multiple assessments, and the possibility that genetic risk is shaped by multiple loci with small effects. Methods We developed a 14-year long-term average depression measure based on 14 years of follow-up in the Nurses' Health Study (NHS; N = 6989 women). We estimated polygenic scores (PS) with internal whole-genome scoring (NHS-GWAS-PS). We also constructed PS by applying two external PS weighting algorithms from independent samples, one previously shown to predict depression (GAIN-MDD-PS) and another from the largest genome-wide analysis currently available (PGC-MDD-PS). We assessed the association of all three PS with our long-term average depression phenotype using linear, logistic, and quantile regressions. ResultsIn this study, the three PS approaches explained at most 0.2% of variance in the long-term average phenotype. Quantile regressions indicated PS had larger impacts at higher quantiles of depressive symptoms. Quantile regression coefficients at the 75th percentile were at least 40% larger than at the 25th percentile in all three polygenic scoring algorithms. The interquartile range comparison suggested the effects of PS significantly differed at the 25th and 75th percentiles of the long-term depressive phenotype for the PGC-MDD-PS (P = 0.03), and this difference also reached borderline statistical significance for the GAIN-MDD-PS (P = 0.05). Conclusions Integrating multiple phenotype assessments spanning 14 years and applying different polygenic scoring approaches did not substantially improve genetic prediction of depression. Quantile regressions suggested the effects of PS may be largest at high quantiles of depressive symptom scores, presumably among people with additional, unobserved sources of vulnerability to depression.
    03/2014; 4(2):298-311. DOI:10.1002/brb3.205
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Epilepsy comprises several syndromes, amongst the most common being mesial temporal lobe epilepsy with hippocampal sclerosis. Seizures in mesial temporal lobe epilepsy with hippocampal sclerosis are typically drug-resistant, and mesial temporal lobe epilepsy with hippocampal sclerosis is frequently associated with important co-morbidities, mandating the search for better understanding and treatment. The cause of mesial temporal lobe epilepsy with hippocampal sclerosis is unknown, but there is an association with childhood febrile seizures. Several rarer epilepsies featuring febrile seizures are caused by mutations in SCN1A, which encodes a brain-expressed sodium channel subunit targeted by many anti-epileptic drugs. We undertook a genome-wide association study in 1018 people with mesial temporal lobe epilepsy with hippocampal sclerosis and 7552 control subjects, with validation in an independent sample set comprising 959 people with mesial temporal lobe epilepsy with hippocampal sclerosis and 3591 control subjects. To dissect out variants related to a history of febrile seizures, we tested cases with mesial temporal lobe epilepsy with hippocampal sclerosis with (overall n = 757) and without (overall n = 803) a history of febrile seizures. Meta-analysis revealed a genome-wide significant association for mesial temporal lobe epilepsy with hippocampal sclerosis with febrile seizures at the sodium channel gene cluster on chromosome 2q24.3 [rs7587026, within an intron of the SCN1A gene, P = 3.36 x 10-9, odds ratio (A) = 1.42, 95% confidence interval: 1.26-1.59]. In a cohort of 172 individuals with febrile seizures, who did not develop epilepsy during prospective follow-up to age 13 years, and 6456 controls, no association was found for rs7587026 and febrile seizures. These findings suggest SCN1A involvement in a common epilepsy syndrome, give new direction to biological understanding of mesial temporal lobe epilepsy with hippocampal sclerosis with febrile seizures, and open avenues for investigation of prognostic factors and possible prevention of epilepsy in some children with febrile seizures.
    Brain 01/2013; DOI:10.1093/brain/awt233 · 10.23 Impact Factor

Preview (2 Sources)

1 Download
Available from