Optimal Methods for Meta-Analysis of Genome-Wide Association Studies

Department of Health Research and Policy, Stanford University, Stanford, California, USA.
Genetic Epidemiology (Impact Factor: 2.6). 11/2011; 35(7):581-91. DOI: 10.1002/gepi.20603
Source: PubMed

ABSTRACT Meta-analysis of genome-wide association studies involves testing single nucleotide polymorphisms (SNPs) using summary statistics that are weighted sums of site-specific score or Wald statistics. This approach avoids having to pool individual-level data. We describe the weights that maximize the power of the summary statistics. For small effect-sizes, any choice of weights yields summary Wald and score statistics with the same power, and the optimal weights are proportional to the square roots of the sites' Fisher information for the SNP's regression coefficient. When SNP effect size is constant across sites, the optimal summary Wald statistic is the well-known inverse-variance-weighted combination of estimated regression coefficients, divided by its standard deviation. We give simple approximations to the optimal weights for various phenotypes, and show that weights proportional to the square roots of study sizes are suboptimal for data from case-control studies with varying case-control ratios, for quantitative trait data when the trait variance differs across sites, for count data when the site-specific mean counts differ, and for survival data with different proportions of failing subjects. Simulations suggest that weights that accommodate intersite variation in imputation error give little power gain compared to those obtained ignoring imputation uncertainties. We note advantages to combining site-specific score statistics, and we show how they can be used to assess effect-size heterogeneity across sites. The utility of the summary score statistic is illustrated by application to a meta-analysis of schizophrenia data in which only site-specific P-values and directions of association are available.

8 Reads
  • [Show abstract] [Hide abstract]
    ABSTRACT: The genetic traits that result in autoimmune diseases represent complicating factors in explicating the molecular and cellular elements of autoimmune responses and how these responses can be overcome or manipulated. This article focuses on the major non-major histocompatibility complex genes that have been found to be linked to autoimmune diseases. A given gene may associate with a number of autoimmune diseases and, conversely, a given disease may link to a number of common autoimmune disease (AD) genes. Collaboration and interaction among genes and the number of diseases that develop and the extensive risk factors shared among ADs further complicate the outcome. This article describes the various relationships between gene regions associated with multiple ADs and the complexity of those relationships.
    Critical Reviews in Immunology 01/2012; 32(3):193-285. · 3.70 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Understanding the effects of gene-environment interaction on complex human diseases or traits in genome-wide association studies (GWAS) can help uncover novel genes and identify environmental hazards that influence only certain genetically susceptible groups. Thus there is a pressing need to develop efficient and powerful interaction analysis methods. In this paper, we propose a novel meta-analysis method of gene-environment interaction, based on meta-regression (MR-M&I). Compared with existing meta-analysis methods, MR-M&I allows for heterogeneity in the environmental factor (E) by dividing the subjects in each study into groups according to the distribution of E. Moreover, it can readily estimate linear or non-linear interactions, and thus it is more generally applicable to different scenarios. We use numerical examples to demonstrate the performance of MR-M&I and compare it with two commonly used methods in current GWAS. The results show that MR-M&I is more powerful than the other methods.
    Genomic Signal Processing and Statistics, (GENSIPS), 2012 IEEE International Workshop on; 01/2012
  • [Show abstract] [Hide abstract]
    ABSTRACT: Abstract In genetic association studies (GAS) as well as in genome-wide association studies (GWAS), the mode of inheritance (dominant, additive and recessive) is usually not known a priori. Assuming an incorrect mode of inheritance may lead to substantial loss of power, whereas on the other hand, testing all possible models may result in an increased type I error rate. The situation is even more complicated in the meta-analysis of GAS or GWAS, in which individual studies are synthesized to derive an overall estimate. Meta-analysis increases the power to detect weak genotype effects, but heterogeneity and incompatibility between the included studies complicate things further. In this review, we present a comprehensive summary of the statistical methods used for robust analysis and genetic model selection in GAS and GWAS. We then discuss the application of such methods in the context of meta-analysis. We describe the theoretical properties of the various methods and the foundations on which they are based. We also present the available software implementations of the described methods. Finally, since only few of the available robust methods have been applied in the meta-analysis setting, we present some simple extensions that allow robust meta-analysis of GAS and GWAS. Possible extensions and proposals for future work are also discussed.
    Statistical Applications in Genetics and Molecular Biology 04/2013; 12(3):1-24. DOI:10.1515/sagmb-2012-0016 · 1.13 Impact Factor
Show more

Preview (2 Sources)

8 Reads
Available from