Integrated pathway-level analysis of transcriptomics and metabolomics data with IMPaLA

Department of Vertebrate Genomics, Max Planck Institute for Molecular Genetics, Berlin, Germany.
Bioinformatics (Impact Factor: 4.98). 09/2011; 27(20):2917-8. DOI: 10.1093/bioinformatics/btr499
Source: PubMed


Pathway-level analysis is a powerful approach enabling interpretation of post-genomic data at a higher level than that of individual biomolecules. Yet, it is currently hard to integrate more than one type of omics data in such an approach. Here, we present a web tool 'IMPaLA' for the joint pathway analysis of transcriptomics or proteomics and metabolomics data. It performs over-representation or enrichment analysis with user-specified lists of metabolites and genes using over 3000 pre-annotated pathways from 11 databases. As a result, pathways can be identified that may be disregulated on the transcriptional level, the metabolic level or both. Evidence of pathway disregulation is combined, allowing for the identification of additional pathways with changed activity that would not be highlighted when analysis is applied to any of the functional levels alone. The tool has been implemented both as an interactive website and as a web service to allow a programming interface.
The web interface of IMPaLA is available at A web services programming interface is provided at;;
Supplementary data are available at Bioinformatics online.

Download full-text


Available from: Atanas Kamburov
  • Source
    • "Putatively annotated metabolite names of significant DIMS peaks from the three databases were combined together, generating a list of significantly changed DIMS peaks for the polar metabolomics dataset (with one or more putative metabolite names). A list of annotated metabolite names was then submitted to Integrated Molecular Pathway-Level Analysis (IMPaLA) website (Kamburov et al., 2011) in compound list format before using Fisher's Exact test for pathway overrepresentation analysis. Putative annotation of significantly changed DIMS peaks from non-polar DIMS dataset was not subjected to pathway over-representation analysis due to less reliable annotation of lipid data. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Humans are routinely exposed to mixtures of flame retardants (FRs) from multiple sources including indoor dust. As a model to explore the potential effects of FR exposure from indoor dust on human health, the molecular responses of human hepatoma cells (HepG2/C3A cells) to a defined mixture of FRs and to a dust extract were investigated using multiple non-targeted omics approaches. A solvent extract of an indoor dust standard reference material SRM2585 was used as the surrogate dust sample, while a mixture of four FRs (TCEP, TCIPP, TDCIPP and HBCD) was used to mimic the FR mixture in the indoor dust. Cytotoxicity tests indicated there were no significant changes to cell viability or cell integrity after a 24- or 72-h exposure of HepG2/C3A cells to the FR mixture or to the dust extract. However, transcriptomics revealed changes in gene expression associated with the metabolism of xenobiotics (e.g. CYP1A1, CYP1A2, CYP2B6) in the dust extract group but not in the FR mixture group after a 72-h exposure. Few metabolic or lipidomic changes were detected in response to either the FR mixture or to the dust extract group. Given that the dust extract contained components that elicited a biological response, in contrast to the lack of response induced by the FR mixture, our findings suggest that the most likely causes of the molecular responses to indoor dust exposure lie in components other than the four FRs investigated, e.g. caused by PAHs or PCBs.
    Full-text · Article · Nov 2015 · Chemosphere
    • "For functional analysis of CsA-induced omics changes, pathway analyses were performed by using IMPaLA (Kamburov et al., 2011). "
    [Show abstract] [Hide abstract]
    ABSTRACT: In order to improve attrition rates of candidate-drugs there is a need for a better understanding of the mechanisms underlying drug-induced hepatotoxicity. We aim to further unravel the toxicological response of hepatocytes to a prototypical cholestatic compound by integrating transcriptomic and metabonomic profiling of HepG2 cells exposed to Cyclosporin A. Cyclosporin A exposure induced intracellular cholesterol accumulation and diminished intracellular bile acid levels. Performing pathway analyses of significant mRNAs and metabolites separately and integrated, resulted in more relevant pathways for the latter. Integrated analyses showed pathways involved in cell cycle and cellular metabolism to be significantly changed. Moreover, pathways involved in protein processing of the endoplasmic reticulum, bile acid biosynthesis and cholesterol metabolism were significantly affected. Our findings indicate that an integrated approach combining metabonomics and transcriptomics data derived from representative in vitro models, with bioinformatics can improve our understanding of the mechanisms of action underlying drug-induced hepatotoxicity. Furthermore, we showed that integrating multiple omics and thereby analyzing genes, microRNAs and metabolites of the opposed model for drug-induced cholestasis can give valuable information about mechanisms of drug-induced cholestasis in vitro and therefore could be used in toxicity screening of new drug candidates at an early stage of drug discovery. Copyright © 2015. Published by Elsevier Ltd.
    No preview · Article · Apr 2015 · Toxicology in Vitro
  • Source
    • "The availability of these massive data ideally allows for complex and detailed modeling of the underlying biological system but they also pose a serious potential multiple testing problem because the number of covariates are typically orders of magnitude larger than the number of observations. Recently, investigators are starting to combine several of these high-dimensional dataset which has increased the demand for analysis methods that accommodates these vast data (Nie et al., 2006; Brink-Jensen et al., 2013; Su et al., 2011; Kamburov et al., 2011). "
    [Show abstract] [Hide abstract]
    ABSTRACT: Penalized regression models such as the Lasso have proved useful for variable selection in many fields - especially for situations with high-dimensional data where the numbers of predictors far exceeds the number of observations. These methods identify and rank variables of importance but do not generally provide any inference of the selected variables. Thus, the variables selected might be the "most important" but need not be significant. We propose a significance test for the selection found by the Lasso. We introduce a procedure that computes inference and p-values for features chosen by the Lasso. This method rephrases the null hypothesis and uses a randomization approach which ensures that the error rate is controlled even for small samples. We demonstrate the ability of the algorithm to compute $p$-values of the expected magnitude with simulated data using a multitude of scenarios that involve various effects strengths and correlation between predictors. The algorithm is also applied to a prostate cancer dataset that has been analyzed in recent papers on the subject. The proposed method is found to provide a powerful way to make inference for feature selection even for small samples and when the number of predictors are several orders of magnitude larger than the number of observations. The algorithm is implemented in the MESS package in R and is freely available.
    Full-text · Article · Mar 2014
Show more