Genome-Wide Association Analyses Identify SPOCK as a Key Novel Gene Underlying Age at Menarche

School of Medicine, University of Missouri Kansas City, Kansas City, Missouri, United States of America.
PLoS Genetics (Impact Factor: 8.17). 03/2009; 5(3):e1000420. DOI: 10.1371/journal.pgen.1000420
Source: PubMed

ABSTRACT For females, menarche is a most significant physiological event. Age at menarche (AAM) is a trait with high genetic determination and is associated with major complex diseases in women. However, specific genes for AAM variation are largely unknown. To identify genetic factors underlying AAM variation, a genome-wide association study (GWAS) examining about 380,000 SNPs was conducted in 477 Caucasian women. A follow-up replication study was performed to validate our major GWAS findings using two independent Caucasian cohorts with 854 siblings and 762 unrelated subjects, respectively, and one Chinese cohort of 1,387 unrelated subjects--all females. Our GWAS identified a novel gene, SPOCK (Sparc/Osteonectin, CWCV, and Kazal-like domains proteoglycan), which had seven SNPs associated with AAM with genome-wide false discovery rate (FDR) q<0.05. Six most significant SNPs of the gene were selected for validation in three independent replication cohorts. All of the six SNPs were replicated in at least one cohort. In particular, SNPs rs13357391 and rs1859345 were replicated both within and across different ethnic groups in all three cohorts, with p values of 5.09 x 10(-3) and 4.37 x 10(-3), respectively, in the Chinese cohort and combined p values (obtained by Fisher's method) of 5.19 x 10(-5) and 1.02 x 10(-4), respectively, in all three replication cohorts. Interestingly, SPOCK can inhibit activation of MMP-2 (matrix metalloproteinase-2), a key factor promoting endometrial menstrual breakdown and onset of menstrual bleeding. Our findings, together with the functional relevance, strongly supported that the SPOCK gene underlies variation of AAM.

Download full-text


Available from: Albert Z TANG, Aug 07, 2015
  • [Show abstract] [Hide abstract]
    ABSTRACT: Systems genetics is a new discipline based on the transcription mapping, which is also called “genetical genomics”. In recent years, systems genetics has become more practical because of advances in science and technology. Analysis of expression quantitative trait loci (eQTLs) is an emerging technique in which individuals are genotyped across a panel of genetic markers and, simultaneously, phenotyped using DNA microarrays. Depending on eQTL mapping, one can infer the underlying regulatory network responsible for complex diseases or quantitative trait phenotypes. Systems genetics approaches integrate DNA sequence variation, variation in transcript abundance and other molecular phenotypes and variation in organismal phenotypes in a linkage or association mapping population, and allow us to interpret quantitative genetic variation in terms of biologically meaningful causal networks of correlated transcripts. These approaches have been made possible due to the development of massively parallel technologies for quantifying genome-wide levels of transcript abundance. The predictive power of the networks could be enhanced by more systematically integrating protein-protein interactions, protein-DNA interactions, protein-RNA interactions, RNA-RNA interactions, protein state information, methylation state, and interactions with metabolites. Systems genetics research will change the traditional approaches based on reductionism, and allows us to reconsider the living phenomenon and complex disease mechanism. Systems genetics benefits from varied “omics” researches (such as transcriptomics, metabolomics, and phenomics) and the development of bioinformatics tools and mathematical modeling, and will become mature in the near future like many other branches of genetics. Systems genetics is leading researchers to understand genetics systems from holism’s viewpoint, and will open a wide field of vision for genetics researchers in systems biology era.
    Biologia 06/2012; 67(3). DOI:10.2478/s11756-012-0026-9 · 0.70 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Puberty is an important developmental stage during which reproductive capacity is attained. The timing of puberty varies greatly among healthy individuals in the general population and is influenced by both genetic and environmental factors. Although genetic variation is known to influence the normal spectrum of pubertal timing, the specific genes involved remain largely unknown. Genetic analyses have identified a number of genes responsible for rare disorders of pubertal timing such as hypogonadotropic hypogonadism and Kallmann syndrome. Recently, the first loci with common variation reproducibly associated with population variation in the timing of puberty were identified at 6q21 in or near LIN28B and at 9q31.2. However, these two loci explain only a small fraction of the genetic contribution to population variation in pubertal timing, suggesting the need to continue to consider other loci and other types of variants. Here we provide an update of the genes implicated in disorders of puberty, discuss genes and pathways that may be involved in the timing of normal puberty, and suggest additional avenues of investigation to identify genetic regulators of puberty in the general population.
    Molecular and Cellular Endocrinology 02/2010; 324(1-2):21-9. DOI:10.1016/j.mce.2010.01.038 · 4.24 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: As pivotal immune guardians, B cells were found to be directly associated with the onset and development of many smoking-induced diseases. However, the in vivo molecular response of B cells underlying the female cigarette smoking remains unknown. Using the genome-wide Affymetrix HG-133A GeneChip microarray, we firstly compared the gene expression profiles of peripheral circulating B cells between 39 smoking and 40 non-smoking healthy US white women. A total of 125 differential expressed genes were identified in our study, and 75.2% of them were down-regulated in smokers. We further obtained genotypes of 702 single nucleotide polymorphisms in those promising genes and assessed their associations with smoking status. Using a novel multicriteria evaluation model integrating information from microarray and the association studies, several genes were further revealed to play important roles in the response of smoking, including ICOSLG (CD275, inducible T-cell co-stimulator ligand), TCF3 (E2A immunoglobulin enhancer binding factors E12/E47), VCAM1 (CD106, vascular cell adhesion molecule 1), CCR1 (CD191, chemokine C-C motif receptor 1) and IL13 (interleukin 13). The differential expression of ICOSLG (p = 0.0130) and TCF3 (p = 0.0125) genes between the two groups were confirmed by real-time reverse transcription PCR experiment. Our findings support the functional importance of the identified genes in response to the smoking stimulus. This is the first in vivo genome-wide expression study on B cells at today's context of high prevalence rate of smoking for women. Our results highlight the potential usage of integrated analyses for unveiling the novel pathogenesis mechanism and emphasized the significance of B cells in the etiology of smoking-induced disease.
    Immunogenetics 03/2010; 62(4):237-51. DOI:10.1007/s00251-010-0431-6 · 2.49 Impact Factor
Show more