Sampling GWAS subjects from risk populations
Institute of Human Genetics, Klinikum rechts der Isar, TU München, Munich, Germany. Genetic Epidemiology
(Impact Factor: 2.6).
04/2011; 35(3):148-53. DOI: 10.1002/gepi.20562
Power, i.e. sample size, is a crucial issue in genome-wide association studies (GWAS) on disorders generated by a multitude of weak genetic effects. Here, we examine the influence of sampling cases and/or controls from populations that are subjected to an external risk factor (such as smoking or nutritional factors). We use an additive threshold model and derive the necessary sample size as function of the external risk factor's strength and of the sampling scheme. If both cases and controls are sampled from the risk population, a loss of power must be expected. The loss of power (i.e. the increase of the necessary sample size) is even larger if only the cases are sampled from the risk population, whereas the inverse scheme (nonrisk cases and risk controls) provides a gain of power since nonrisk cases are enriched for disease-favouring alleles while risk controls are enriched for protective alleles. For small effect sizes, we derive simple approximations in analytically closed form. A strategy of GWAS sample collection from risk populations minimizing the necessary sample sizes may thus be deduced that generally applies as long as strong gene-environment interactions can be excluded.
Figures in this publication
Data provided are for informational purposes only. Although carefully collected, accuracy cannot be guaranteed. The impact factor represents a rough estimation of the journal's impact factor and does not reflect the actual current impact factor. Publisher conditions are provided by RoMEO. Differing provisions from the publisher's actual policy or licence agreement may be applicable.