Article

Disease signatures are robust across tissues and experiments.

Stanford Center for Biomedical Informatics Research, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA.
Molecular Systems Biology (Impact Factor: 11.34). 01/2009; 5:307. DOI: 10.1038/msb.2009.66
Source: PubMed

ABSTRACT Meta-analyses combining gene expression microarray experiments offer new insights into the molecular pathophysiology of disease not evident from individual experiments. Although the established technical reproducibility of microarrays serves as a basis for meta-analysis, pathophysiological reproducibility across experiments is not well established. In this study, we carried out a large-scale analysis of disease-associated experiments obtained from NCBI GEO, and evaluated their concordance across a broad range of diseases and tissue types. On evaluating 429 experiments, representing 238 diseases and 122 tissues from 8435 microarrays, we find evidence for a general, pathophysiological concordance between experiments measuring the same disease condition. Furthermore, we find that the molecular signature of disease across tissues is overall more prominent than the signature of tissue expression across diseases. The results offer new insight into the quality of public microarray data using pathophysiological metrics, and support new directions in meta-analysis that include characterization of the commonalities of disease irrespective of tissue, as well as the creation of multi-tissue systems models of disease pathology using public data.

1 Bookmark
 · 
73 Views
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Transcription factor cross-repression is an important concept in cellular differentiation. A bistable toggle switch constitutes a molecular mechanism that determines cellular commitment and provides stability to transcriptional programs of binary cell fate choices. Experiments support that perturbations of these toggle switches can interconvert these binary cell fate choices, suggesting potential reprogramming strategies. However, more complex types of cellular transitions could involve perturbations of combinations of different types of multistable motifs. Here we introduce a method that generalizes the concept of transcription factor cross-repression to systematically predict sets of genes, whose perturbations induce cellular transitions between any given pair of cell types. Furthermore, to our knowledge, this is the first method that systematically makes these predictions without prior knowledge of potential candidate genes and pathways involved, providing guidance on systems where little is known. Given the increasing interest of cellular reprogramming in medicine and basic research, our method represents a useful computational methodology to assist researchers in the field in designing experimental strategies.
    Stem Cells 07/2013; · 7.70 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: We utilized abundant transcriptomic data for the primary classes of brain cancers to study the feasibility of separating all of these diseases simultaneously based on molecular data alone. These signatures were based on a new method reported herein - Identification of Structured Signatures and Classifiers (ISSAC) - that resulted in a brain cancer marker panel of 44 unique genes. Many of these genes have established relevance to the brain cancers examined herein, with others having known roles in cancer biology. Analyses on large-scale data from multiple sources must deal with significant challenges associated with heterogeneity between different published studies, for it was observed that the variation among individual studies often had a larger effect on the transcriptome than did phenotype differences, as is typical. For this reason, we restricted ourselves to studying only cases where we had at least two independent studies performed for each phenotype, and also reprocessed all the raw data from the studies using a unified pre-processing pipeline. We found that learning signatures across multiple datasets greatly enhanced reproducibility and accuracy in predictive performance on truly independent validation sets, even when keeping the size of the training set the same. This was most likely due to the meta-signature encompassing more of the heterogeneity across different sources and conditions, while amplifying signal from the repeated global characteristics of the phenotype. When molecular signatures of brain cancers were constructed from all currently available microarray data, 90% phenotype prediction accuracy, or the accuracy of identifying a particular brain cancer from the background of all phenotypes, was found. Looking forward, we discuss our approach in the context of the eventual development of organ-specific molecular signatures from peripheral fluids such as the blood.
    PLoS Computational Biology 07/2013; 9(7):e1003148. · 4.87 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The lack of specific symptoms at early tumor stages, together with a high biological aggressiveness of the tumor contribute to the high mortality rate for pancreatic cancer (PC), which has a five year survival rate of less than 5%. Improved screening for earlier diagnosis, through the detection of diagnostic and prognostic biomarkers provides the best hope of increasing the rate of curatively resectable carcinomas. Though many serum markers have been reported to be elevated in patients with PC, so far, most of these markers have not been implemented into clinical routine due to low sensitivity or specificity. In this study, we have identified genes that are significantly upregulated in PC, through a meta-analysis of large number of microarray datasets. We demonstrate that the biological functions ascribed to these genes are clearly associated with PC and metastasis, and that that these genes exhibit a strong link to pathways involved with inflammation and the immune response. This investigation has yielded new targets for cancer genes, and potential biomarkers for pancreatic cancer. The candidate list of cancer genes includes protein kinase genes, new members of gene families currently associated with PC, as well as genes not previously linked to PC. In this study, we are also able to move towards developing a signature for hypomethylated genes, which could be useful for early detection of PC. We also show that the significantly upregulated 800+ genes in our analysis can serve as an enriched pool for tissue and serum protein biomarkers in pancreatic cancer.
    PLoS ONE 01/2014; 9(4):e93046. · 3.53 Impact Factor

Full-text (2 Sources)

View
15 Downloads
Available from
May 16, 2014