Integrating siRNA and protein-protein interaction data to identify an expanded insulin signaling network

Rosetta Inpharmatics, a wholly owned subsidiary of Merck & Co., Inc., Seattle, Washington 98109, USA.
Genome Research (Impact Factor: 14.63). 04/2009; 19(6):1057-67. DOI: 10.1101/gr.087890.108
Source: PubMed


Insulin resistance is one of the dominant symptoms of type 2 diabetes (T2D). Although the molecular mechanisms leading to this resistance are largely unknown, experimental data support that the insulin signaling pathway is impaired in patients who are insulin resistant. To identify novel components/modulators of the insulin signaling pathway, we designed siRNAs targeting over 300 genes and tested the effects of knocking down these genes in an insulin-dependent, anti-lipolysis assay in 3T3-L1 adipocytes. For 126 genes, significant changes in free fatty acid release were observed. However, due to off-target effects (in addition to other limitations), high-throughput RNAi-based screens in cell-based systems generate significant amounts of noise. Therefore, to obtain a more reliable set of genes from the siRNA hits in our screen, we developed and applied a novel network-based approach that elucidates the mechanisms of action for the true positive siRNA hits. Our analysis results in the identification of a core network underlying the insulin signaling pathway that is more significantly enriched for genes previously associated with insulin resistance than the set of genes annotated in the KEGG database as belonging to the insulin signaling pathway. We experimentally validated one of the predictions, S1pr2, as a novel candidate gene for T2D.

Download full-text


Available from: Jun Zhu, Aug 28, 2014
  • Source
    • "Building on the hypothesis that neighboring genes within an interaction network share a common biological function, other network studies seed known disease genes in functional networks combining evidence from the literature, functional annotation, genomic distances, or genetic variation data (i.e., GWAS, SNP, eQTL), to search for nearby putative genes [13-15]. Related work integrates experimental data in the interaction network, for example, significant genes from regulatory or proteomic experiments, to discover candidate genes given their proximity to query genes [16-18]. "
    [Show abstract] [Hide abstract]
    ABSTRACT: The etiology of cancer involves a complex series of genetic and environmental conditions. To better represent and study the intricate genetics of cancer onset and progression, we construct a network of biological interactions to search for groups of genes that compose cancer-related modules. Three cancer expression datasets are investigated to prioritize genes and interactions associated with cancer outcomes. Using a graph-based approach to search for communities of phenotype-related genes in microarray data, we find modules of genes associated with cancer phenotypes in a weighted interaction network. We implement Walktrap, a random-walk-based community detection algorithm, to identify biological modules predisposing to tumor growth in 22 hepatocellular carcinoma samples (GSE14520), adenoma development in 32 colorectal cancer samples (GSE8671), and prognosis in 198 breast cancer patients (GSE7390). For each study, we find the best scoring partitions under a maximum cluster size of 200 nodes. Significant modules highlight groups of genes that are functionally related to cancer and show promise as therapeutic targets; these include interactions among transcription factors (SPIB, RPS6KA2 and RPS6KA6), cell-cycle regulatory genes (BRSK1, WEE1 and CDC25C), a modulator of cell-(CBLC) and genes that regulate and participate in the map-kinase pathway (MAPK9, DUSP1, DUSP9, RIPK2). To assess the performance of Walktrap to find genomic modules (Walktrap-GM), we evaluate our results against other tools recently developed to discover disease modules in biological networks. Compared with other highly cited module-finding tools, jActiveModules and Matisse, Walktrap-GM shows strong performance in the discovery of modules enriched with known cancer genes. These results demonstrate that the Walktrap-GM algorithm identifies modules significantly enriched with cancer genes, their joint effects and promising candidate genes. The approach performs well when evaluated against similar tools and smaller overall module size allows for more specific functional annotation and facilitates the interpretation of these modules.
    Full-text · Article · Oct 2013 · BioData Mining
  • Source
    • "Briefly, 5 µg of total RNA from an individual sample were used to synthesize dsDNA by RT. cRNA was produced by in vitro transcription and post-transcriptionally labeled with either Cy3 or Cy5. Reference and experimental cRNA samples were competitively hybridized to the Rosetta/Merck Mouse 25k v1.9 microarray which represents 22,700 genes (Tu et al., 2009). To minimize bias created by dye selection, for each comparison, two hybridizations were done with each cRNA sample pair using a fluorescent dye reversal strategy. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Estrogen Receptor α (ERα) and Estrogen Receptor β (ERβ) are steroid nuclear receptors that transduce estrogen signaling to control diverse physiological processes linked to reproduction, bone remodeling, behavior, immune response and endocrine-related diseases. In order to differentiate between ERα and ERβ mediated effects in vivo, ER subtype selective biomarkers are essential. We utilized ERα knockout (AERKO) and ERβ knockout (BERKO) mouse liver RNA and genome wide profiling to identify novel ERα selective serum biomarker candidates. Results from the gene array experiments were validated using real-time RT-PCR and subsequent ELISA's to demonstrate changes in serum proteins. Here we present data that Lipopolysacharide Binding Protein (LBP) is a novel liver-derived ERα selective biomarker that can be measured in serum.
    Full-text · Article · Mar 2012 · Biomarkers
  • Source
    • "(19–25)]. These methodologies were based on a variety of computational techniques including maximum likelihood (19), integer programming (21), Steiner trees (23,24), electric circuits (22,26) and Bayesian networks (2,27). Notably, many of these techniques were computationally intensive and therefore required the use of approximation schemes [e.g. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Cellular response to stimuli is typically complex and involves both regulatory and metabolic processes. Large-scale experimental efforts to identify components of these processes often comprise of genetic screening and transcriptomic profiling assays. We previously established that in yeast genetic screens tend to identify response regulators, while transcriptomic profiling assays tend to identify components of metabolic processes. ResponseNet is a network-optimization approach that integrates the results from these assays with data of known molecular interactions. Specifically, ResponseNet identifies a high-probability sub-network, composed of signaling and regulatory molecular interaction paths, through which putative response regulators may lead to the measured transcriptomic changes. Computationally, this is achieved by formulating a minimum-cost flow optimization problem and solving it efficiently using linear programming tools. The ResponseNet web server offers a simple interface for applying ResponseNet. Users can upload weighted lists of proteins and genes and obtain a sparse, weighted, molecular interaction sub-network connecting their data. The predicted sub-network and its gene ontology enrichment analysis are presented graphically or as text. Consequently, the ResponseNet web server enables researchers that were previously limited to separate analysis of their distinct, large-scale experiments, to meaningfully integrate their data and substantially expand their understanding of the underlying cellular response. ResponseNet is available at
    Full-text · Article · May 2011 · Nucleic Acids Research
Show more