Multi-Organ Expression Profiling Uncovers a Gene Module in Coronary Artery Disease Involving Transendothelial Migration of Leukocytes and LIM Domain Binding 2: The Stockholm Atherosclerosis Gene Expression (STAGE) Study

The Computational Medicine Group, Atherosclerosis Research Unit, Department of Medicine, Karolinska Institutet, Stockholm, Sweden.
PLoS Genetics (Impact Factor: 7.53). 12/2009; 5(12):e1000754. DOI: 10.1371/journal.pgen.1000754
Source: PubMed


Environmental exposures filtered through the genetic make-up of each individual alter the transcriptional repertoire in organs central to metabolic homeostasis, thereby affecting arterial lipid accumulation, inflammation, and the development of coronary artery disease (CAD). The primary aim of the Stockholm Atherosclerosis Gene Expression (STAGE) study was to determine whether there are functionally associated genes (rather than individual genes) important for CAD development. To this end, two-way clustering was used on 278 transcriptional profiles of liver, skeletal muscle, and visceral fat (n = 66/tissue) and atherosclerotic and unaffected arterial wall (n = 40/tissue) isolated from CAD patients during coronary artery bypass surgery. The first step, across all mRNA signals (n = 15,042/12,621 RefSeqs/genes) in each tissue, resulted in a total of 60 tissue clusters (n = 3958 genes). In the second step (performed within tissue clusters), one atherosclerotic lesion (n = 49/48) and one visceral fat (n = 59) cluster segregated the patients into two groups that differed in the extent of coronary stenosis (P = 0.008 and P = 0.00015). The associations of these clusters with coronary atherosclerosis were validated by analyzing carotid atherosclerosis expression profiles. Remarkably, in one cluster (n = 55/54) relating to carotid stenosis (P = 0.04), 27 genes in the two clusters relating to coronary stenosis were confirmed (n = 16/17, P<10(-27 and-30)). Genes in the transendothelial migration of leukocytes (TEML) pathway were overrepresented in all three clusters, referred to as the atherosclerosis module (A-module). In a second validation step, using three independent cohorts, the A-module was found to be genetically enriched with CAD risk by 1.8-fold (P<0.004). The transcription co-factor LIM domain binding 2 (LDB2) was identified as a potential high-hierarchy regulator of the A-module, a notion supported by subnetwork analysis, by cellular and lesion expression of LDB2, and by the expression of 13 TEML genes in Ldb2-deficient arterial wall. Thus, the A-module appears to be important for atherosclerosis development and, together with LDB2, merits further attention in CAD research.

Download full-text


Available from: Eric Schadt
  • Source
    • "Moreover, Pi15 expression was previously detected at a precise developmental stage in the chicken during organogenesis (33), suggesting its role in development. Furthermore, Pi15 expression has been detected in the human atherosclerotic aortic wall (12) and in the hyperlipidemic mouse aorta (3). "
    [Show abstract] [Hide abstract]
    ABSTRACT: The inbred Brown Norway (BN) rat develops spontaneous ruptures of the internal elastic lamina (RIEL) of the abdominal aorta (AA) and iliac arteries. Prior studies with crosses of the BN/Orl RJ (susceptible) and LOU/M (resistant) showed the presence of a significant QTL on chromosome 5 and the production of congenic rats proved the involvement of this locus. In this study, we further dissected the above-mentioned QTL by creating a new panel of LOU.BN(chr5) congenic and sub-congenic lines and reduced the locus to 5.2 Mb. Then 1002 HS rats were studied, whose phenotyping revealed a low prevalence and high variability for RIEL. High resolution mapping in the HS panel detected the major locus on chromosome 5 (log P>35) and refined it to 1.4Mb. Subsequently, RNA-seq analysis on AA of BN, congenics and LOU revealed expression differences for only protease inhibitor15 (Pi15) gene and a putative long intergenic noncoding RNA (lincRNA) within the linkage region. The high abundance of lincRNA with respect to reduced Pi15 expression, in conjunction with exertion of longitudinal strain, may be related to RIEL, indicating the potential importance of proteases in biological processes related to defective aortic IEL structure. Similar mechanisms may be involved in aneurysm initiation in the human AA.
    Full-text · Article · May 2014 · Physiological Genomics
  • Source
    • "We next determined if differentially expressed genes showed an overall relationship with cardiovascular disease. To investigate this, we first combined previously compiled lists of CAD-associated genes [30,31]. From the combined list, we identified 3424 genes which showed expression in our RNA-Seq data. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Massively-parallel cDNA sequencing (RNA-Seq) is a new technique that holds great promise for cardiovascular genomics. Here, we used RNA-Seq to study the transcriptomes of matched coronary artery disease cases and controls in the ClinSeq(R) study, using cell lines as tissue surrogates. Lymphoblastoid cell lines (LCLs) from 16 cases and controls representing phenotypic extremes for coronary calcification were cultured and analyzed using RNA-Seq. All cell lines were then independently re-cultured and along with another set of 16 independent cases and controls, were profiled with Affymetrix microarrays to perform a technical validation of the RNA-Seq results. Statistically significant changes (p < 0.05) were detected in 186 transcripts, many of which are expressed at extremely low levels (5-10 copies/cell), which we confirmed through a separate spike-in control RNA-Seq experiment. Next, by fitting a linear model to exon-level RNA-Seq read counts, we detected signals of alternative splicing in 18 transcripts. Finally, we used the RNA-Seq data to identify differential expression (p < 0.0001) in eight previously unannotated regions that may represent novel transcripts. Overall, differentially expressed genes showed strong enrichment (p = 0.0002) for prior association with cardiovascular disease. At the network level, we found evidence for perturbation in pathways involving both cardiovascular system development and function as well as lipid metabolism. We present a pilot study for transcriptome involvement in coronary artery calcification and demonstrate how RNA-Seq analyses using LCLs as a tissue surrogate may yield fruitful results in a clinical sequencing project. In addition to canonical gene expression, we present candidate variants from alternative splicing and novel transcript detection, which have been unexplored in the context of this disease.
    Full-text · Article · Mar 2014 · BMC Genomics
  • Source
    • "To test kruX we provide example analysis scripts and a small anonymised dataset of 2,000 randomly selected genes and markers from 100 randomly selected yeast segregants [12]. Here we describe an application of kruX on a human dataset of 19,610 genes and 530,222 SNP markers measured in 102 whole blood samples from the Stockholm Atherosclerosis Gene Expression (STAGE) study [13]. All SNPs in the dataset had minor allele frequency greater than 5%, no missing values and probability to be in Hardy-Weinberg equilibrium greater than 10 −6 . "
    [Show abstract] [Hide abstract]
    ABSTRACT: The Kruskal-Wallis test is a popular non-parametric statistical test for identifying expression quantitativetrait loci (eQTLs) from genome-wide data due to its robustness against variations in the underlyinggenetic model and expression trait distribution, but testing billions of marker-trait combinationsone-by-one can become computationally prohibitive. We developed kruX, an algorithm implemented in Matlab, Python and R that uses matrix multiplicationsto simultaneously calculate the Kruskal-Wallis test statistic for several millions of marker-traitcombinations at once. KruX is more than ten thousand times faster than computing associations oneby-one on a typical human dataset. We used kruX and a dataset of more than 500k SNPs and 20kexpression traits measured in 102 human blood samples to compare eQTLs detected by the Kruskal-Wallis test to eQTLs detected by the parametric ANOVA and linear model methods. We found that theKruskal-Wallis test is more robust against data outliers and heterogeneous genotype group sizes anddetects a higher proportion of non-linear associations, but is more conservative for calling additivelinear associations. kruX enables the use of robust non-parametric methods for massive eQTL mapping without the needfor a high-performance computing infrastructure and is freely available from
    Full-text · Article · Jan 2014 · BMC Bioinformatics
Show more