An Integrative -omics Approach to Identify Functional Sub-Networks in Human Colorectal Cancer

Center for Proteomics & Bioinformatics, Case Western Reserve University, Cleveland, Ohio, United States of America.
PLoS Computational Biology (Impact Factor: 4.83). 01/2010; 6(1):e1000639. DOI: 10.1371/journal.pcbi.1000639
Source: PubMed

ABSTRACT Emerging evidence indicates that gene products implicated in human cancers often cluster together in "hot spots" in protein-protein interaction (PPI) networks. Additionally, small sub-networks within PPI networks that demonstrate synergistic differential expression with respect to tumorigenic phenotypes were recently shown to be more accurate classifiers of disease progression when compared to single targets identified by traditional approaches. However, many of these studies rely exclusively on mRNA expression data, a useful but limited measure of cellular activity. Proteomic profiling experiments provide information at the post-translational level, yet they generally screen only a limited fraction of the proteome. Here, we demonstrate that integration of these complementary data sources with a "proteomics-first" approach can enhance the discovery of candidate sub-networks in cancer that are well-suited for mechanistic validation in disease. We propose that small changes in the mRNA expression of multiple genes in the neighborhood of a protein-hub can be synergistically associated with significant changes in the activity of that protein and its network neighbors. Further, we hypothesize that proteomic targets with significant fold change between phenotype and control may be used to "seed" a search for small PPI sub-networks that are functionally associated with these targets. To test this hypothesis, we select proteomic targets having significant expression changes in human colorectal cancer (CRC) from two independent 2-D gel-based screens. Then, we use random walk based models of network crosstalk and develop novel reference models to identify sub-networks that are statistically significant in terms of their functional association with these proteomic targets. Subsequently, using an information-theoretic measure, we evaluate synergistic changes in the activity of identified sub-networks based on genome-wide screens of mRNA expression in CRC. Cross-classification experiments to predict disease class show excellent performance using only a few sub-networks, underwriting the strength of the proposed approach in discovering relevant and reproducible sub-networks.

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Reverse engineering approaches to constructing gene regulatory networks (GRNs) based on genome-wide mRNA expression data have led to significant biological findings, such as the discovery of novel drug targets. However, the reliability of the reconstructed GRNs needs to be improved. Here, we propose an ensemble-based network aggregation approach to improving the accuracy of network topologies constructed from mRNA expression data. To evaluate the performances of different approaches, we created dozens of simulated networks from combinations of gene-set sizes and sample sizes and also tested our methods on three Escherichia coli datasets. We demonstrate that the ensemble-based network aggregation approach can be used to effectively integrate GRNs constructed from different studies - producing more accurate networks. We also apply this approach to building a network from epithelial mesenchymal transition (EMT) signature microarray data and identify hub genes that might be potential drug targets. The R code used to perform all of the analyses is available in an R package entitled "ENA", accessible on CRAN (
    PLoS ONE 11/2014; 9(11):e106319. DOI:10.1371/journal.pone.0106319 · 3.53 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Combination therapy, where several drugs interact with multiple targets, holds tremendous promise for effective clinical outcomes in the management of chronic, complex diseases such as cancer. In this paper, we take a step towards this grand goal by laying out the vision of a novel in silico, data-driven combination therapy framework called ingot for complex network diseases. Given the genomic and proteomic profiles of a patient population, it automatically predicts “optimal” set of synergistic drug combinations and corresponding dosages, which can potentially achieve the therapeutic goal while minimizing any off-target effects. Towards this goal, we present the architecture of ingot and discuss various non-traditional design challenges and innovative features. Specifically, in ingot, a disease-related probabilistic signaling network (psn) is constructed by integrating publicly-available disease-specific signaling networks with expression data. Next, topology and dynamics of the psn, which can be noisy and incomplete, are analyzed as a whole using probabilistic network analytics techniques to identify promising target combinations with desirable properties (e.g., synergistic in nature, good efficacy and minimum off-target effect) to regulate the activities of key disease-related molecular players. Finally, optimal candidate drug combinations to modulate these targets are predicted by integrating and analyzing drug information (e.g., DrugBank) with the target nodes. Successful realization of this framework can result in an effective platform for in silico screening of drug combinations in a rational way, by aiding early discovery of suitable combination therapy and guiding the design of further in vitro and in vivo experiments.
    2014 International Conference on Big Data and Smart Computing (BIGCOMP); 01/2014
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Several decades of molecular biology research have delivered a wealth of detailed descriptions of molecular interactions in normal and tumour cells. This knowledge has been functionally organised and assembled into dedicated biological pathway resources that serve as an invaluable tool, not only for structuring the information about molecular interactions but also for making it available for biological, clinical and computational studies. With the advent of high-throughput molecular profiling of tumours, close to complete molecular catalogues of mutations, gene expression and epigenetic modifications are available and require adequate interpretation. Taking into account the information about biological signalling machinery in cells may help to better interpret molecular profiles of tumours. Making sense out of these descriptions requires biological pathway resources for functional interpretation of the data. In this review, we describe the available biological pathway resources, their characteristics in terms of construction mode, focus, aims and paradigms of biological knowledge representation. We present a new resource that is focused on cancer-related signalling, the Atlas of Cancer Signalling Networks. We briefly discuss current approaches for data integration, visualisation and analysis, using biological networks, such as pathway scoring, guilt-by-association and network propagation. Finally, we illustrate with several examples the added value of data interpretation in the context of biological networks and demonstrate that it may help in analysis of high-throughput data like mutation, gene expression or small interfering RNA screening and can guide in patients stratification. Finally, we discuss perspectives for improving precision medicine using biological network resources and tools. Taking into account the information about biological signalling machinery in cells may help to better interpret molecular patterns of tumours and enable to put precision oncology into general clinical practice. © The Author 2015. Published by Oxford University Press on behalf of the UK Environmental Mutagen Society. All rights reserved. For permissions, please e-mail:
    Mutagenesis 01/2014; 30(2). DOI:10.1093/mutage/geu078 · 3.50 Impact Factor

Preview (2 Sources)

1 Download
Available from