A human functional protein interaction network and its application to cancer data analysis

Ontario Institute for Cancer Research, MaRS Centre, South Tower, 101 College Street, Suite 800, Toronto, ON M5G 0A3, Canada.
Genome biology (Impact Factor: 10.81). 05/2010; 11(5):R53. DOI: 10.1186/gb-2010-11-5-r53
Source: PubMed


One challenge facing biologists is to tease out useful information from massive data sets for further analysis. A pathway-based analysis may shed light by projecting candidate genes onto protein functional relationship networks. We are building such a pathway-based analysis system.
We have constructed a protein functional interaction network by extending curated pathways with non-curated sources of information, including protein-protein interactions, gene coexpression, protein domain interaction, Gene Ontology (GO) annotations and text-mined protein interactions, which cover close to 50% of the human proteome. By applying this network to two glioblastoma multiforme (GBM) data sets and projecting cancer candidate genes onto the network, we found that the majority of GBM candidate genes form a cluster and are closer than expected by chance, and the majority of GBM samples have sequence-altered genes in two network modules, one mainly comprising genes whose products are localized in the cytoplasm and plasma membrane, and another comprising gene products in the nucleus. Both modules are highly enriched in known oncogenes, tumor suppressors and genes involved in signal transduction. Similar network patterns were also found in breast, colorectal and pancreatic cancers.
We have built a highly reliable functional interaction network upon expert-curated pathways and applied this network to the analysis of two genome-wide GBM and several other cancer data sets. The network patterns revealed from our results suggest common mechanisms in the cancer biology. Our system should provide a foundation for a network or pathway-based analysis platform for cancer and other diseases.

1 Follower
12 Reads
  • Source
    • "Over twofold peptide absolute counts from LANA SIM than control samples with twice repeats were selected for analysis. Based on the proteomics band analysis and the whole genome protein–protein interaction network in humans [21] [22] [23], both core network (119 genes) and extended network (49 456 genes) were constructed for all gel slices analyzed. In the core network, two interaction partners are in the corresponding proteomics slice. "
    [Show abstract] [Hide abstract]
    ABSTRACT: The Kaposi's sarcoma-associated herpesvirus (KSHV)-encoded latent nuclear antigen LANA plays an essential role in viral episome maintenance. LANA also contributes to DNA replication and tumorigenesis during latency. Recent studies suggested that LANA was involved in regulation of SUMOylation which results in chromatin silencing. To examine the pleiotropic effects of LANA protein on host cell gene expression, we utilized MS analysis to identify cellular proteins associated with the SUMO-Interacting Motif of LANA (LANA(SIM) ). In addition to the 6 bands identified as substantially associated with LANA(SIM) , 151 proteins were positively identified by MS/MS analysis. Compared with previous proteomic analysis of the N- and C- truncated mutants of LANA (LANA(NC) ), our results revealed that a complex of specific proteins with relatively high SUMOylation and SIM motifs are associated with LANA(SIM) . Intriguingly, consistent with our previous report that identified KAP1 as a key component, the in-vitro SUMO-2 modified isoform has a substantially higher affinity with LANA(SIM) than the SUMO-1 modified isoform. Moreover, via cluster and pathway analysis, we proposed a hypothetical model for the LANA(SIM) regulatory circuit involving aberrant SUMOylation of cell cycle (particular mitotic), DNA unwinding and replication, and pre-mRNA/mRNA processing-related proteins. This study provides a SUMOylated and non-SUMOylated proteome profile of LANA(SIM) -associated complex, and facilitates our understanding that viral-mediated gene regulation through SUMOylation is important for KSHV persistence and pathogenesis. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
    Proteomics 04/2015; 15(12). DOI:10.1002/pmic.201400624 · 3.81 Impact Factor
  • Source
    • "ReactomeFIViz implements multiple features for users to perform network-based data analysis, including FI sub-network construction 3, network module discovery 3, functional annotation 3, HotNet mutation analysis 5, 6, and network module-based gene signature discovery from microarray data sets 7. The HotNet algorithm 5, 6 was implemented by porting python and MatLab code of HotNet _v1.0.0 (downloaded from to Java and R. For details about other algorithms and their implementations, please refer to our previous work 3, 7. "
    [Show abstract] [Hide abstract]
    ABSTRACT: High-throughput experiments are routinely performed in modern biological studies. However, extracting meaningful results from massive experimental data sets is a challenging task for biologists. Projecting data onto pathway and network contexts is a powerful way to unravel patterns embedded in seemingly scattered large data sets and assist knowledge discovery related to cancer and other complex diseases. We have developed a Cytoscape app called "ReactomeFIViz", which utilizes a highly reliable gene functional interaction network combined with human curated pathways derived from Reactome and other pathway databases. This app provides a suite of features to assist biologists in performing pathway- and network-based data analysis in a biologically intuitive and user-friendly way. Biologists can use this app to uncover network and pathway patterns related to their studies, search for gene signatures from gene expression data sets, reveal pathways significantly enriched by genes in a list, and integrate multiple genomic data types into a pathway context using probabilistic graphical models. We believe our app will give researchers substantial power to analyze intrinsically noisy high-throughput experimental data to find biologically relevant information.
    F1000 Research 09/2014; 3:146. DOI:10.12688/f1000research.4431.2
  • Source
    • "We believe that network inference approaches inspiring network medicine (Zanzoni et al., 2009; Barabasi et al., 2011), can be further specialized to allow characterization of cancer hallmarks through their many interfaces with clinical data, but also genetics, omics, and imaging modalities. How functional sub-networks can apply to cancer is illustrated in some recent work (Wu et al., 2010; Nibbe et al., 2010; Wen et al., 2013). Sub-networks are network partitions that can be obtained in various ways. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Cancer is a multifactorial and heterogeneous disease. The corresponding complexity appears at multiple levels: from the molecular and the cellular constitution to the macroscopic phenotype, and at the diagnostic and therapeutic management stages. The overall complexity can be approximated to a certain extent, e.g. characterized by a set of quantitative phenotypic observables recorded in time-space resolved dimensions by using multimodal imaging approaches. The transition from measures to data can be made effective through various computational inference methods, including networks, which are inherently capable of mapping variables and data to node- and/or edge-valued topological properties, dynamic modularity configurations, and functional motifs. We illustrate how networks can integrate imaging data to explain cancer complexity, and assess potential pre-clinical and clinical impact.
    Molecular Oncology 09/2014; 9(1). DOI:10.1016/j.molonc.2014.08.013 · 5.33 Impact Factor
Show more

Preview (2 Sources)

12 Reads
Available from