Article

[Genome-wide association study on complex diseases: genetic statistical issues].

School of Public Health, Xinjiang Medical University, Urumqi 830054, China.
Hereditas (Beijing) 06/2008; 30(5):543-9. pp.543-9
Source: PubMed

ABSTRACT Since the first genome-wide association study on human age-related macular degeneration was reported by Science journal in 2005, a series of genome-wide association studies have been published on human complex diseases or traits, such as obesity, type 2 diabetes, coronary artery disease, Alzheimer's disease and so on. The study of human genetics has recently undergone a dramatic transition which is called "the first wave of genome-wide association study". Some issues in statistical analysis of genome-wide association studies were reviewed by this paper. First, statistical analysis guidelines, methods and examples for genome-wide association studies of different designs, including unrelated case-control studies, population-based studies, and family-based association studies; second, multiple testing correction of P values, including Bonferroni correction, step-down Bonferroni correction, permutation correction, and the correction based on false discovery rate; third, population stratification and its effect on inference of genotype-phenotype associations. The False Positive Re-port Probability has been successfully applied in a recent genome-wide association study on coronary artery disease to con-trol the population stratification. Although genetic statistical methodology has been greatly developed in control of false positive associations caused by multiple testing or population stratification, it is still not sufficient to achieve the goal. Replicating genotype-phenotype associations is the only way to identify true association between genetic markers and common disease traits. The first wave of genome-wide association studies is producing an impressive list of unexpected associations between genes or chromosomal regions and a broad range of diseases. Traditional statistical techniques are adequate for the analysis and interpretation of these results. However, much more sophisticated methods of statistical analysis are likely to be required as we delve further into the genome in the search for networks of interacting gene variants, or interactions be-tween gene-gene networks and environmental factors. Finally, some useful links about statistical software for genome-wide association studies were provided.

0 0
 · 
1 Bookmark
 · 
83 Views
  • Source
    Article: Complement factor H polymorphism in age-related macular degeneration.
    [show abstract] [hide abstract]
    ABSTRACT: Age-related macular degeneration (AMD) is a major cause of blindness in the elderly. We report a genome-wide screen of 96 cases and 50 controls for polymorphisms associated with AMD. Among 116,204 single-nucleotide polymorphisms genotyped, an intronic and common variant in the complement factor H gene (CFH) is strongly associated with AMD (nominal P value <10(-7)). In individuals homozygous for the risk allele, the likelihood of AMD is increased by a factor of 7.4 (95% confidence interval 2.9 to 19). Resequencing revealed a polymorphism in linkage disequilibrium with the risk allele representing a tyrosine-histidine change at amino acid 402. This polymorphism is in a region of CFH that binds heparin and C-reactive protein. The CFH gene is located on chromosome 1 in a region repeatedly linked to AMD in family-based studies.
    Science 04/2005; 308(5720):385-9. · 31.20 Impact Factor
  • Article: Comment on "A common genetic variant is associated with adult and childhood obesity".
    [show abstract] [hide abstract]
    ABSTRACT: Contrary to the findings of Herbert et al. (Reports, 14 April 2006, p. 279), homozygous carriers of the C allele of the rs7566605 variant near the INSIG2 gene did not exhibit a significantly increased risk for obesity in a large population-based cross-sectional German study. A subgroup analysis, however, revealed that this allele significantly increased the risk for obesity in already overweight individuals.
    Science 02/2007; 315(5809):187; author reply 187. · 31.20 Impact Factor
  • Source
    Article: Genomic Control, a New Approach to Genetic-Based Association Studies
    [show abstract] [hide abstract]
    ABSTRACT: During the past decade, mutations affecting liability to human disease have been discovered at a phenomenal rate, and that rate is increasing. For the most part, however, those diseases have a relatively simple genetic basis. For diseases with a complex genetic and environmental basis, new approaches are needed to pave the way for more rapid discovery of genes affecting liability. One such approach exploits large, population-based samples and large-scale genotyping to evaluate disease/gene associations. A substantial drawback to such samples is the fact that population heterogeneity can induce spurious associations between genes and disease. We describe a method called genomic control (GC), which obviates many of the concerns about population substructure by using the features of the genomes present in the sample to correct for stratification. Two such approaches are now available. The GC approach exploits the fact that population substructure generate “overdispersion” of statistics used to assess association. By testing multiple polymorphisms throughout the genome, only some of which are pertinent to the disease of interest, the degree of overdispersion generated by population substructure can be estimated and taken into account. The other approach, called Structured Association (SA), assumes that the sampled population, while heterogeneous, is composed of subpopulations that are themselves homogeneous. By using multiple polymorphisms throughout the genome, SA probabilistically assigns sampled individuals to these latent subpopulations. We review in detail the overdispersion GC. In addition to outlining the published ideas on this method, we describe several extensions: quantitative trait studies and case–control studies with haplotypes and multiallelic markers. For each study design our goal is to achieve control similar to that obtained for a family-based study, but with the convenience found in a population-based design.
    Theoretical Population Biology 12/2001; · 1.65 Impact Factor

Full-text

View
0 Downloads
Available from

Keywords

broad range
 
chromosomal regions
 
dramatic transition
 
false positive associations
 
False Positive Re-port Probability
 
first genome-wide association study
 
first wave
 
genetic statistical methodology
 
genome-wide association studies
 
genome-wide association study"
 
genotype-phenotype associations
 
human complex diseases
 
interactions be-tween gene-gene networks
 
multiple testing
 
recent genome-wide association study
 
Replicating genotype-phenotype associations
 
statistical software
 
Traditional statistical techniques
 
unexpected associations
 
useful links
 

Wei-Li Yan