Principles and Strategies for Developing Network Models in Cancer

Department of Biological Sciences, Columbia University, 1212 Amsterdam Avenue, New York, NY 10027, USA.
Cell (Impact Factor: 32.24). 03/2011; 144(6):864-73. DOI: 10.1016/j.cell.2011.03.001
Source: PubMed


The flood of genome-wide data generated by high-throughput technologies currently provides biologists with an unprecedented opportunity: to manipulate, query, and reconstruct functional molecular networks of cells. Here, we outline three underlying principles and six strategies to infer network models from genomic data. Then, using cancer as an example, we describe experimental and computational approaches to infer "differential" networks that can identify genes and processes driving disease phenotypes. In conclusion, we discuss how a network-level understanding of cancer can be used to predict drug response and guide therapeutics.

Download full-text


Available from: Dana Pe'er, Feb 13, 2014
  • Source
    • "We also have to keep in mind that molecular networks exhibit dynamic responses to both internal states and external signals. Ultimately, health or disease states emerge from an individual's integration of these internal and external signals [47]. PPI networks are also dynamic. "
    [Show abstract] [Hide abstract]
    ABSTRACT: The challenging task of studying and modeling complex dynamics of biological systems in order to describe various human diseases has gathered great interest in recent years. Major biological processes are mediated through protein interactions, hence there is a need to understand the chaotic network that forms these processes in pursuance of understanding human diseases. The applications of protein interaction networks to disease datasets allow the identification of genes and proteins associated with diseases, the study of network properties, identification of subnetworks, and network-based disease gene classification. Although various protein interaction network analysis strategies have been employed, grand challenges are still existing. Global understanding of protein interaction networks via integration of high-throughput functional genomics data from different levels will allow researchers to examine the disease pathways and identify strategies to control them. As a result, it seems likely that more personalized, more accurate and more rapid disease gene diagnostic techniques will be devised in the future, as well as novel strategies that are more personalized. This mini-review summarizes the current practice of protein interaction networks in medical research as well as challenges to be overcome.
    Computational and Structural Biotechnology Journal 08/2014; 11(18):22-7. DOI:10.1016/j.csbj.2014.08.008
  • Source
    • "RNA microarrays have had a major impact on both experimental and computational biology. They have played a role in predicting molecular targets and bioactive compound modes-of-action [1-3], they have helped identify genes responsible for disease- and environmental-induced phenotypes [4-6]. At the same time, statistical methods for interpreting genome-wide microarray data have progressed over the past decade. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Genome-wide microarrays have been useful for predicting chemical-genetic interactions at the gene level. However, interpreting genome-wide microarray results can be overwhelming due to the vast output of gene expression data combined with off-target transcriptional responses many times induced by a drug treatment. This study demonstrates how experimental and computational methods can interact with each other, to arrive at more accurate predictions of drug-induced perturbations. We present a two-stage strategy that links microarray experimental testing and network training conditions to predict gene perturbations for a drug with a known mechanism of action in a well-studied organism. S. cerevisiae cells were treated with the antifungal, fluconazole, and expression profiling was conducted under different biological conditions using Affymetrix genome-wide microarrays. Transcripts were filtered with a formal network-based method, sparse simultaneous equation models and Lasso regression (SSEM-Lasso), under different network training conditions. Gene expression results were evaluated using both gene set and single gene target analyses, and the drug's transcriptional effects were narrowed first by pathway and then by individual genes. Variables included: (i) Testing conditions - exposure time and concentration and (ii) Network training conditions - training compendium modifications. Two analyses of SSEM-Lasso output - gene set and single gene - were conducted to gain a better understanding of how SSEM-Lasso predicts perturbation targets. This study demonstrates that genome-wide microarrays can be optimized using a two-stage strategy for a more in-depth understanding of how a cell manifests biological reactions to a drug treatment at the transcription level. Additionally, a more detailed understanding of how the statistical model, SSEM-Lasso, propagates perturbations through a network of gene regulatory interactions is achieved.
    BMC Systems Biology 01/2014; 8(1):7. DOI:10.1186/1752-0509-8-7 · 2.44 Impact Factor
  • Source
    • "Regulatory network modeling has been widely used for a systematic understanding of disease progression at the molecular level, particularly for cancer (comprehensively reviewed by Peer and Hacohen) [7]. Recently, Carro et al. applied a reverse engineering method for context-specific transcriptional regulatory networks to 176 gene expression profiles from high-grade glioblastoma (HGG) patients. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Gene expression signatures have been commonly used as diagnostic and prognostic markers for cancer subtyping. However, expression signatures frequently include many passengers, which are not directly related to cancer progression. Their upstream regulators such as transcription factors (TFs) may take a more critical role as drivers or master regulators to provide better clues on the underlying regulatory mechanisms and therapeutic applications. In order to identify prognostic master regulators, we took the known 85 prognostic signature genes for colorectal cancer and inferred their upstream TFs. To this end, a global transcriptional regulatory network was constructed with total >200,000 TF-target links using the ARACNE algorithm. We selected the top 10 TFs as candidate master regulators to show the highest coverage of the signature genes among the total 846 TF-target sub-networks or regulons. The selected TFs showed a comparable or slightly better prognostic performance than the original 85 signature genes in spite of greatly reduced number of marker genes from 85 to 10. Notably, these TFs were selected solely from inferred regulatory links using gene expression profiles and included many TFs regulating tumorigenic processes such as proliferation, metastasis, and differentiation. Our network approach leads to the identification of the upstream transcription factors for prognostic signature genes to provide leads to their regulatory mechanisms. We demonstrate that our approach could identify upstream biomarkers for a given set of signature genes with markedly smaller size and comparable performances. The utility of our method may be expandable to other types of signatures such as diagnosis and drug response.
    BMC Systems Biology 09/2013; 7(1):86. DOI:10.1186/1752-0509-7-86 · 2.44 Impact Factor
Show more