Automated Network Analysis Identifies Core Pathways in Glioblastoma

Memorial Sloan-Kettering Cancer Center, New York, New York, United States of America.
PLoS ONE (Impact Factor: 3.53). 02/2010; 5(2):e8918. DOI: 10.1371/journal.pone.0008918
Source: PubMed

ABSTRACT Glioblastoma multiforme (GBM) is the most common and aggressive type of brain tumor in humans and the first cancer with comprehensive genomic profiles mapped by The Cancer Genome Atlas (TCGA) project. A central challenge in large-scale genome projects, such as the TCGA GBM project, is the ability to distinguish cancer-causing "driver" mutations from passively selected "passenger" mutations.
In contrast to a purely frequency based approach to identifying driver mutations in cancer, we propose an automated network-based approach for identifying candidate oncogenic processes and driver genes. The approach is based on the hypothesis that cellular networks contain functional modules, and that tumors target specific modules critical to their growth. Key elements in the approach include combined analysis of sequence mutations and DNA copy number alterations; use of a unified molecular interaction network consisting of both protein-protein interactions and signaling pathways; and identification and statistical assessment of network modules, i.e. cohesive groups of genes of interest with a higher density of interactions within groups than between groups.
We confirm and extend the observation that GBM alterations tend to occur within specific functional modules, in spite of considerable patient-to-patient variation, and that two of the largest modules involve signaling via p53, Rb, PI3K and receptor protein kinases. We also identify new candidate drivers in GBM, including AGAP2/CENTG1, a putative oncogene and an activator of the PI3K pathway; and, three additional significantly altered modules, including one involved in microtubule organization. To facilitate the application of our network-based approach to additional cancer types, we make the method freely available as part of a software tool called NetBox.

Download full-text


Available from: Nikolaus Schultz, Jul 27, 2015
1 Follower
  • Source
    • "Shell command line, regular expression, Cytoscape (Shannon et al., 2003), Netbox (Cerami et al., 2010), Inkscape, Reactome (Joshi-Tope et al., 2005; Matthews et al., 2009) were used and are described on detail in supplemental tutorial or in this paper (Mezhoud, 2013). "
    [Show abstract] [Hide abstract]
    ABSTRACT: Abstract: Radiology is an essential technology in medicine and is used for organ diagnosis, radio-tracing and radiotherapy. The risks for the radiobiology workers are not assess sufficiently because measuring instruments fail to detect very low doses. This manuscript presents an investigation on the potential risks for radiobiology workers, due to the occupational exposure to low doses of irradiation. In this respect we used plasma samples from 16 subjects which were suppose to receive very low dose of irradiation in a longer time period. We used chromosomal aberrations, means of oxidative stress measurement and combined proteomics and bioinformatics in order to elucidate risks of such exposure. We found significant chromosome aberrations in lymphocytes and the increase of oxidative stress biomarkers in plasma. In addition the proteomic analysis shows differentially regulated proteins which three of them were verified by ELISA tests. This proteomic analysis picks out some interesting proteins that may belong to biomarkers panel of radiation exposure.
    International Journal of Low Radiation 03/2014; DOI:10.1504/IJLR.2014.060911
  • Source
    • "A fundamental challenge in translating cancer genomic findings into clinical application lies in the ability to find " driver " genetic and genomic alterations that contribute to tumor initiation, progression, and metastasis (Chin and Gray, 2008; Simon, 2010). As integrated genomic studies have emerged, it has become increasingly clear that true oncogenic mechanisms are more visible when combining evidence across patterns of alterations in DNA copy number, methylation, gene expression and mutational profiles (TCGA, 2008; Chitale et al., 2009; Cerami et al., 2010; Vaske et al., 2010). "
    [Show abstract] [Hide abstract]
    ABSTRACT: High resolution microarrays and second-generation sequenc-ing platforms are powerful tools to investigate genome-wide alterations in DNA copy number, methylation, and gene expression associated with a dis-ease. An integrated genomic profiling approach measuring these omics data types simultaneously in the same set of biological samples would render an integrated data resolution that would not be available with any single data type. In the context of integrative genomics, we propose a statistical frame-work that aims to address clustering, data integration, dimension reduction, and variable selection in a unified and computationally efficient way. We de-rive sparse integrative clustering solutions by incorporating regularization methods including the Lasso, Elasticnet, and Fused Lasso. The proposed method is applied to integrate genomic, epigenomic, and transcriptomic data for subtype analysis in breast cancer.
    The Annals of Applied Statistics 03/2013; 7(1). DOI:10.1214/12-AOAS578 · 1.69 Impact Factor
  • Source
    • "Network modules with sequence mutations were enriched in known oncogenes, tumor suppressors and signal transduction genes. Similar PPIN patterns were found in breast, colorectal and pancreatic cancers (Pujol et al., 2009; Wu & Stein, 2010; Cerami et al., 2010). "
    [Show abstract] [Hide abstract]
    ABSTRACT: Biological networks are now the starting point of many studies for understanding and curing human diseases. A protein–protein interaction (PPI) involves two or more proteins binding together, often to carry out their biological function. Many of the most important molecular processes in the cell, such as DNA replication, are carried out by molecular machines that are built from a large number of pro-tein components organised by their PPIs. A set of PPIs occurring at the same time and location of the cell is known as a complex of proteins. A protein-protein interaction network (PPIN) is a collection of PPIs, often deposited in online databases. PPINs may complement other datasets, such as protein structural information, which may lead to understanding the different subparts that contribute to the function of a whole biological system (Bapat et al., 2010). Known molecular pathways can be embed -ded in PPINs to derive new knowledge, such as on spatio-temporal dynamics (Schmid & McMahon, 2007; Srinivasan et al., 2007). Understanding PPINs allows one to target them with drugs in order to block harmful interactions and potentially cure diseases (Pujol et al., 2009). PPINs are also playing a role in the emerging field of synthetic biology that promises to create biological systems with new properties, and modules of networks are available ( (Russell & Aloy, 2008; Aloy, 2007). A major issue in using PPINs in practice involves dealing with errors in the form of missing in -teractions and false signals. 1 Experimental methods for extracting PPINs and applications The yeast two-hybrid (Y2H) system and affinity purification followed by mass spectrometry (AP-MS) are two commonly used experimental methods for detecting interacting proteins. The Y2H system identifies direct interactions between pairs of proteins that physically interact with each other, represented as binary relationships. Large-scale experiments involve making yeast colony arrays; each yeast colony expresses a defined pair of bait and prey proteins that are scored for activity in an automated manner, where high activity indicates interaction. AP-MS detects presence of a protein in a
    Protein Purification and Analysis, 01/2013: chapter Protein-protein interaction networks.; iConcept Press., ISBN: ISBN: 978-1477555-05-7
Show more