Integrated pathway-level analysis of transcriptomics and metabolomics data with IMPaLA

Department of Vertebrate Genomics, Max Planck Institute for Molecular Genetics, Berlin, Germany.
Bioinformatics (Impact Factor: 4.98). 09/2011; 27(20):2917-8. DOI: 10.1093/bioinformatics/btr499
Source: PubMed


Pathway-level analysis is a powerful approach enabling interpretation of post-genomic data at a higher level than that of individual biomolecules. Yet, it is currently hard to integrate more than one type of omics data in such an approach. Here, we present a web tool 'IMPaLA' for the joint pathway analysis of transcriptomics or proteomics and metabolomics data. It performs over-representation or enrichment analysis with user-specified lists of metabolites and genes using over 3000 pre-annotated pathways from 11 databases. As a result, pathways can be identified that may be disregulated on the transcriptional level, the metabolic level or both. Evidence of pathway disregulation is combined, allowing for the identification of additional pathways with changed activity that would not be highlighted when analysis is applied to any of the functional levels alone. The tool has been implemented both as an interactive website and as a web service to allow a programming interface.
The web interface of IMPaLA is available at A web services programming interface is provided at;;
Supplementary data are available at Bioinformatics online.

Download full-text


Available from: Atanas Kamburov,
    • "For functional analysis of CsA-induced omics changes, pathway analyses were performed by using IMPaLA (Kamburov et al., 2011). "
    [Show abstract] [Hide abstract]
    ABSTRACT: In order to improve attrition rates of candidate-drugs there is a need for a better understanding of the mechanisms underlying drug-induced hepatotoxicity. We aim to further unravel the toxicological response of hepatocytes to a prototypical cholestatic compound by integrating transcriptomic and metabonomic profiling of HepG2 cells exposed to Cyclosporin A. Cyclosporin A exposure induced intracellular cholesterol accumulation and diminished intracellular bile acid levels. Performing pathway analyses of significant mRNAs and metabolites separately and integrated, resulted in more relevant pathways for the latter. Integrated analyses showed pathways involved in cell cycle and cellular metabolism to be significantly changed. Moreover, pathways involved in protein processing of the endoplasmic reticulum, bile acid biosynthesis and cholesterol metabolism were significantly affected. Our findings indicate that an integrated approach combining metabonomics and transcriptomics data derived from representative in vitro models, with bioinformatics can improve our understanding of the mechanisms of action underlying drug-induced hepatotoxicity. Furthermore, we showed that integrating multiple omics and thereby analyzing genes, microRNAs and metabolites of the opposed model for drug-induced cholestasis can give valuable information about mechanisms of drug-induced cholestasis in vitro and therefore could be used in toxicity screening of new drug candidates at an early stage of drug discovery. Copyright © 2015. Published by Elsevier Ltd.
    Toxicology in Vitro 04/2015; 29(3). DOI:10.1016/j.tiv.2014.12.016 · 2.90 Impact Factor
    • "Towards this goal, the web-based Integrated Molecular Pathway-Level Analysis (IMPaLA; http:// (Kamburov et al., 2011)) was used. IMPaLA combines the analysis using a comprehensive basis of biochemical pathways currently taken from 11 publicly available resources. "
    [Show abstract] [Hide abstract]
    ABSTRACT: The objective of this study was to contribute to the first comprehensive metabolomic characterization of the human sperm cell through the application of two untargeted platforms based on proton nuclear magnetic resonance ((1) H-NMR) spectroscopy and gas chromatography coupled to mass spectrometry (GC-MS). Using these two complementary strategies, we were able to identify a total of 69 metabolites, of which 42 were identified using NMR, 27 using GC-MS and 4 by both techniques. The identity of some of these metabolites was further confirmed by two-dimensional (1) H-(1) H homonuclear correlation spectroscopy (COSY) and (1) H-(13) C heteronuclear single-quantum correlation (HSQC) spectroscopy. Most of the metabolites identified are reported here for the first time in mature human spermatozoa. The relationship between the metabolites identified and the previously reported sperm proteome was also explored. Interestingly, overrepresented pathways included not only the metabolism of carbohydrates, but also of lipids and lipoproteins. Of note, a large number of the metabolites identified belonged to the amino acids, peptides and analogues super class. The identification of this initial set of metabolites represents an important first step to further study their function in male gamete physiology and to explore potential reasons for dysfunction in future studies. We also demonstrate that the application of NMR and MS provides complementary results, thus constituting a promising strategy towards the completion of the human sperm cell metabolome. © 2015 American Society of Andrology and European Academy of Andrology.
    Andrology 04/2015; 3(3). DOI:10.1111/andr.12027 · 2.30 Impact Factor
  • Source
    • "The availability of these massive data ideally allows for complex and detailed modeling of the underlying biological system but they also pose a serious potential multiple testing problem because the number of covariates are typically orders of magnitude larger than the number of observations. Recently, investigators are starting to combine several of these high-dimensional dataset which has increased the demand for analysis methods that accommodates these vast data (Nie et al., 2006; Brink-Jensen et al., 2013; Su et al., 2011; Kamburov et al., 2011). "
    [Show abstract] [Hide abstract]
    ABSTRACT: Penalized regression models such as the Lasso have proved useful for variable selection in many fields - especially for situations with high-dimensional data where the numbers of predictors far exceeds the number of observations. These methods identify and rank variables of importance but do not generally provide any inference of the selected variables. Thus, the variables selected might be the "most important" but need not be significant. We propose a significance test for the selection found by the Lasso. We introduce a procedure that computes inference and p-values for features chosen by the Lasso. This method rephrases the null hypothesis and uses a randomization approach which ensures that the error rate is controlled even for small samples. We demonstrate the ability of the algorithm to compute $p$-values of the expected magnitude with simulated data using a multitude of scenarios that involve various effects strengths and correlation between predictors. The algorithm is also applied to a prostate cancer dataset that has been analyzed in recent papers on the subject. The proposed method is found to provide a powerful way to make inference for feature selection even for small samples and when the number of predictors are several orders of magnitude larger than the number of observations. The algorithm is implemented in the MESS package in R and is freely available.
Show more