MGV: A Generic Graph Viewer for Comparative Omics Data

Center for Bioinformatics Tübingen, Faculty of Science, University of Tübingen, 72076 Tübingen, Germany.
Bioinformatics (Impact Factor: 4.98). 06/2011; 27(16):2248-55. DOI: 10.1093/bioinformatics/btr351
Source: PubMed


High-throughput transcriptomics, proteomics and metabolomics methods have revolutionized our knowledge of biological systems. To gain knowledge from comparative omics studies, strong data integration and visualization features are required. Knowledge gained from these studies is often available in the form of graphs, and their visualization is especially useful in a wide range of systems biology topics, including pathway analysis, interaction networks or gene models. Especially, it is necessary to compare biological models with measured data. This allows the identification of new models and new insights into existing ones.
We present MGV, a versatile generic graph viewer for multiomics data. MGV is integrated into Mayday (Battke et al., 2010). It extends Mayday's visual analytics capabilities by integrating a wide range of biological models, high-throughput data and meta information to display enriched graphs that combine data and models. A wide range of tools is available for visualization of nodes, data-aware graph layout as well as automatic and manual aggregation and refinement of the data. We show the usefulness of MGV applied to several problems, including differential expression of alternative transcripts, transcription factor interaction, cross-study clustering comparison and integration of transcriptomics and metabolomics data for pathway analysis.
MGV is a open-source software implemented in Java and freely available as a part of Mayday at
Supplementary data are available at Bioinformatics online.

Download full-text


Available from: Kay Nieselt
  • Source
    • "Horizontal integration and meta-analysis of microarray data sets has been a field of intensive research in the past decade (3–17). Vertical integration of omics data has recently become a major focus in bioinformatics, as multi-level omics data sets (including single-nucleotide polymorphisms, gene, protein and metabolite expression data) are increasingly being collected from large clinical cohort studies (18–23). "
    [Show abstract] [Hide abstract]
    ABSTRACT: The widespread applications of various 'omics' technologies in biomedical research together with the emergence of public data repositories have resulted in a plethora of data sets for almost any given physiological state or disease condition. Properly combining or integrating these data sets with similar basic hypotheses can help reduce study bias, increase statistical power and improve overall biological understanding. However, the difficulties in data management and the complexities of analytical approaches have significantly limited data integration to enable meta-analysis. Here, we introduce integrative meta-analysis of expression data (INMEX), a user-friendly web-based tool designed to support meta-analysis of multiple gene-expression data sets, as well as to enable integration of data sets from gene expression and metabolomics experiments. INMEX contains three functional modules. The data preparation module supports flexible data processing, annotation and visualization of individual data sets. The statistical analysis module allows researchers to combine multiple data sets based on P-values, effect sizes, rank orders and other features. The significant genes can be examined in functional analysis module for enriched Gene Ontology terms or Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, or expression profile visualization. INMEX has built-in support for common gene/metabolite identifiers (IDs), as well as 45 popular microarray platforms for human, mouse and rat. Complex operations are performed through a user-friendly web interface in a step-by-step manner. INMEX is freely available at
    Full-text · Article · Jun 2013 · Nucleic Acids Research
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: We explore the utility of p-value weighting for enhancing the power to detect differential metabolites in a two-sample setting. Related gene expression information is used to assign an a priori importance level to each metabolite being tested. We map the gene expression to a metabolite through pathways and then gene expression information is summarized per-pathway using gene set enrichment tests. Through simulation we explore four styles of enrichment tests and four weight functions to convert the gene information into a meaningful p-value weight. We implement the p-value weighting on a prostate cancer metabolomic dataset. Gene expression on matched samples is used to construct the weights. Under certain regulatory conditions, the use of weighted p-values does not inflate the type I error above what we see for the un-weighted tests except in high correlation situations. The power to detect differential metabolites is notably increased in situations with disjoint pathways and shows moderate improvement, relative to the proportion of enriched pathways, when pathway membership overlaps.
    Full-text · Article · Apr 2012 · Genomics
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Motivation: Traditionally, microarrays were almost exclusively used for the genome-wide analysis of differential gene expression. But nowadays, their scope of application has been extended to various genomic features, such as microRNAs (miRNAs), proteins and DNA methylation (DNAm). Most available methods for the visualization of these datasets are focused on individual platforms and are not capable of integratively visualizing multiple microarray datasets from cross-platform studies. Above all, there is a demand for methods that can visualize genomic features that are not directly linked to protein-coding genes, such as regulatory RNAs (e.g. miRNAs) and epigenetic alterations (e.g. DNAm), in a pathway-centred manner. Results: We present a novel pathway-based visualization method that is especially suitable for the visualization of high-throughput datasets from multiple different microarray platforms that were used for the analysis of diverse genomic features in the same set of biological samples. The proposed methodology includes concepts for linking DNAm and miRNA expression datasets to canonical signalling and metabolic pathways. We further point out strategies for displaying data from multiple proteins and protein modifications corresponding to the same gene. Ultimately, we show how data from four distinct platform types (messenger RNA, miRNA, protein and DNAm arrays) can be integratively visualized in the context of canonical pathways. Availability: The described method is implemented as part of the InCroMAP application that is freely available at Contact: or
    Preview · Article · Oct 2012 · Bioinformatics
Show more