An Integrative -omics Approach to Identify Functional Sub-Networks in Human Colorectal Cancer

University of Illinois at Urbana-Champaign, United States of America
PLoS Computational Biology (Impact Factor: 4.62). 01/2010; 6(1):e1000639. DOI: 10.1371/journal.pcbi.1000639
Source: PubMed


Emerging evidence indicates that gene products implicated in human cancers often cluster together in "hot spots" in protein-protein interaction (PPI) networks. Additionally, small sub-networks within PPI networks that demonstrate synergistic differential expression with respect to tumorigenic phenotypes were recently shown to be more accurate classifiers of disease progression when compared to single targets identified by traditional approaches. However, many of these studies rely exclusively on mRNA expression data, a useful but limited measure of cellular activity. Proteomic profiling experiments provide information at the post-translational level, yet they generally screen only a limited fraction of the proteome. Here, we demonstrate that integration of these complementary data sources with a "proteomics-first" approach can enhance the discovery of candidate sub-networks in cancer that are well-suited for mechanistic validation in disease. We propose that small changes in the mRNA expression of multiple genes in the neighborhood of a protein-hub can be synergistically associated with significant changes in the activity of that protein and its network neighbors. Further, we hypothesize that proteomic targets with significant fold change between phenotype and control may be used to "seed" a search for small PPI sub-networks that are functionally associated with these targets. To test this hypothesis, we select proteomic targets having significant expression changes in human colorectal cancer (CRC) from two independent 2-D gel-based screens. Then, we use random walk based models of network crosstalk and develop novel reference models to identify sub-networks that are statistically significant in terms of their functional association with these proteomic targets. Subsequently, using an information-theoretic measure, we evaluate synergistic changes in the activity of identified sub-networks based on genome-wide screens of mRNA expression in CRC. Cross-classification experiments to predict disease class show excellent performance using only a few sub-networks, underwriting the strength of the proposed approach in discovering relevant and reproducible sub-networks.

Download full-text

Full-text preview

Available from: PubMed Central
  • Source
    • "It is believed that dynamic alternations of complex interaction networks and molecular sub-networks can represent and influence responses of cells or organs to real-time changed microenvironment [10-12]. Thus, identification and validation of interaction networks and network biomarkers, especially at the protein level, become critical to develop disease-specific biomarkers for monitoring disease occurrence, progression or treatment efficacy [13-15]. The present review headlights network biomarkers, interaction networks, dynamical network biomarkers, with special focus on respiratory diseases, with an emphasis to integrate bioinformatics-based screening of biomarkers, network biomarker, dynamic network biomarkers with clinical informatics and phenotypes and establish a systems biomedicine-evidenced disease-specific dynamic network biomarkers "
    [Show abstract] [Hide abstract]
    ABSTRACT: Identification and validation of interaction networks and network biomarkers have become more critical and important in the development of disease-specific biomarkers, which are functionally changed during disease development, progression or treatment. The present review headlined the definition, significance, research and potential application for network biomarkers, interaction networks and dynamical network biomarkers (DNB). Disease-specific interaction networks, network biomarkers, or DNB have great significance in the understanding of molecular pathogenesis, risk assessment, disease classification and monitoring, or evaluations of therapeutic responses and toxicities. Protein-based DNB will provide more information to define the differences between the normal and pre-disease stages, which might point to early diagnosis for patients. Clinical bioinformatics should be a key approach to the identification and validation of disease-specific biomarkers.
    Clinical and Translational Medicine 06/2014; 3(1):16. DOI:10.1186/2001-1326-3-16
  • Source
    • "Protein network and mRNA profiles can be integrated to identify subnetwork biomarkers, that is, highly connected genes of a subnetwork whose sum of expression can be a marker of a disease state. There are several network-based approaches for identifying disease genes and protein interaction subnetworks which are disease signatures [86–88]. The application of a network analysis to metabolic PET (positron emission tomography) data obtained from patients with Parkinson's disease resulted in the identification and validation of two distinct spatial covariance patterns associated with the motor and cognitive manifestations of the disease [89]. "
    [Show abstract] [Hide abstract]
    ABSTRACT: one is the increasing capabilities of the computers and software tools from terabytes to petabytes and beyond, and the other is the advancement in high-throughput molecular biology producing piles of data related to genomes, transcriptomes, proteomes, metabolomes, interactomes, and so on. Biology has become a data intensive science and as a consequence biology and computer science have become complementary to each other bridged by other branches of science such as statistics, mathematics, physics, and chemistry. The combination of versatile knowledge has caused the advent of big-data biology, network biology, and other new branches of biology. Network biology for instance facilitates the system-level understanding of the cell or cellular components and subprocesses. It is often also referred to as systems biology. The purpose of this field is to understand organisms or cells as a whole at various levels of functions and mechanisms. Systems biology is now facing the challenges of analyzing big molecular biological data and huge biological networks. This review gives an overview of the progress in big-data biology, and data handling and also introduces some applications of networks and multivariate analysis in systems biology.
    BioMed Research International 05/2014; 2014(12):428570. DOI:10.1155/2014/428570 · 1.58 Impact Factor
  • Source
    • "We use a computational approach, based on a discrete-time random walk process, to track directional information flow in the interactome. Similar formulations have been previously used to prioritize candidate disease genes [82,83], discover network bio-markers for cancer [84], and identify protein complexes [85,86]. Additionally, there is a known correspondence between random-walk methods on undirected graphs and formulations based on circuit network models [87]. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Calorie restriction (CR) is one of the most conserved non-genetic interventions that extends healthspan in evolutionarily distant species, ranging from yeast to mammals. The target of rapamycin (TOR) has been shown to play a key role in mediating healthspan extension in response to CR by integrating different signals that monitor nutrient-availability and orchestrating various components of cellular machinery in response. Both genetic and pharmacological interventions that inhibit the TOR pathway exhibit a similar phenotype, which is not further amplified by CR. In this paper, we present the first comprehensive, computationally derived map of TOR downstream effectors, with the objective of discovering key lifespan mediators, their crosstalk, and high-level organization. We adopt a systematic approach for tracing information flow from the TOR complex and use it to identify relevant signaling elements. By constructing a high-level functional map of TOR downstream effectors, we show that our approach is not only capable of recapturing previously known pathways, but also suggests potential targets for future studies. Information flow scores provide an aggregate ranking of relevance of proteins with respect to the TOR signaling pathway. These rankings must be normalized for degree bias, appropriately interpreted, and mapped to associated roles in pathways. We propose a novel statistical framework for integrating information flow scores, the set of differentially expressed genes in response to rapamycin treatment, and the transcriptional regulatory network. We use this framework to identify the most relevant transcription factors in mediating the observed transcriptional response, and to construct the effective response network of the TOR pathway. This network is hypothesized to mediate life-span extension in response to TOR inhibition. Our approach, unlike experimental methods, is not limited to specific aspects of cellular response. Rather, it predicts transcriptional changes and post-translational modifications in response to TOR inhibition. The constructed effective response network greatly enhances understanding of the mechanisms underlying the aging process and helps in identifying new targets for further investigation of anti-aging regimes. It also allows us to identify potential network biomarkers for diagnosis and prognosis of age-related pathologies.
    BMC Systems Biology 08/2013; 7(1):84. DOI:10.1186/1752-0509-7-84 · 2.44 Impact Factor
Show more