Inferring metabolic States in uncharacterized environments using gene-expression measurements.

Department of Bioinformatics (CMBI), Centre for Molecular Life Sciences, Radboud University Nijmegen, The Netherlands
PLoS Computational Biology (Impact Factor: 4.83). 03/2013; 9(3):e1002988. DOI: 10.1371/journal.pcbi.1002988
Source: PubMed

ABSTRACT The large size of metabolic networks entails an overwhelming multiplicity in the possible steady-state flux distributions that are compatible with stoichiometric constraints. This space of possibilities is largest in the frequent situation where the nutrients available to the cells are unknown. These two factors: network size and lack of knowledge of nutrient availability, challenge the identification of the actual metabolic state of living cells among the myriad possibilities. Here we address this challenge by developing a method that integrates gene-expression measurements with genome-scale models of metabolism as a means of inferring metabolic states. Our method explores the space of alternative flux distributions that maximize the agreement between gene expression and metabolic fluxes, and thereby identifies reactions that are likely to be active in the culture from which the gene-expression measurements were taken. These active reactions are used to build environment-specific metabolic models and to predict actual metabolic states. We applied our method to model the metabolic states of Saccharomyces cerevisiae growing in rich media supplemented with either glucose or ethanol as the main energy source. The resulting models comprise about 50% of the reactions in the original model, and predict environment-specific essential genes with high sensitivity. By minimizing the sum of fluxes while forcing our predicted active reactions to carry flux, we predicted the metabolic states of these yeast cultures that are in large agreement with what is known about yeast physiology. Most notably, our method predicts the Crabtree effect in yeast cells growing in excess glucose, a long-known phenomenon that could not have been predicted by traditional constraint-based modeling approaches. Our method is of immediate practical relevance for medical and industrial applications, such as the identification of novel drug targets, and the development of biotechnological processes that use complex, largely uncharacterized media, such as biofuel production.

  • [Show abstract] [Hide abstract]
    ABSTRACT: Genome-scale metabolic network model (GEM) is a fundamental framework in systems metabolic engineering. GEM is built upon extensive experimental data and literature information on gene annotation and function, metabolites and enzymes so that it contains all known metabolic reactions within an organism. Constraint-based analysis of GEM enables the identification of phenotypic properties of an organism and hypothesis-driven engineering of cellular functions to achieve objectives. Along with the advances in omics, high-throughput technology and computational algorithms, the scope and applications of GEM have substantially expanded. In particular, various computational algorithms have been developed to predict beneficial gene deletion and amplification targets and used to guide the strain development process for the efficient production of industrially important chemicals. Furthermore, an Escherichia coli GEM was integrated with a pathway prediction algorithm and used to evaluate all possible routes for the production of a list of commodity chemicals in E. coli. Combined with the wealth of experimental data produced by high-throughput techniques, much effort has been exerted to add more biological contexts into GEM through the integration of omics data and regulatory network information for the mechanistic understanding and improved prediction capabilities. In this paper, we review the recent developments and applications of GEM focusing on the GEM-based computational algorithms available for microbial metabolic engineering.
    Journal of Industrial Microbiology and Biotechnology 12/2014; 42(3). · 2.51 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Prediction of possible flux distributions in a metabolic network provides detailed phenotypic information that links metabolism to cellular physiology. To estimate metabolic steady-state fluxes, the most common approach is to solve a set of macroscopic mass balance equations subjected to stoichiometric constraints while attempting to optimize an assumed optimal objective function. This assumption is justifiable in specific cases but may be invalid when tested across different conditions, cell populations, or other organisms. With an aim to providing a more consistent and reliable prediction of flux distributions over a wide range of conditions, in this article we propose a framework that uses the flux minimization principle to predict active metabolic pathways from mRNA expression data. The proposed algorithm minimizes a weighted sum of flux magnitudes, while biomass production can be bounded to fit an ample range from very low to very high values according to the analyzed context. We have formulated the flux weights as a function of the corresponding enzyme reaction's gene expression value, enabling the creation of context-specific fluxes based on a generic metabolic network. In case studies of wild-type Saccharomyces cerevisiae, and wild-type and mutant Escherichia coli strains, our method achieved high prediction accuracy, as gauged by correlation coefficients and sums of squared error, with respect to the experimentally measured values. In contrast to other approaches, our method was able to provide quantitative predictions for both model organisms under a variety of conditions. Our approach requires no prior knowledge or assumption of a context-specific metabolic functionality and does not require trial-and-error parameter adjustments. Thus, our framework is of general applicability for modeling the transcription-dependent metabolism of bacteria and yeasts.
    PLoS ONE 11/2014; 9(11):e112524. · 3.53 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Several computational methods have been developed that integrate transcriptomic data with genome-scale metabolic reconstructions to infer condition-specific system-wide intracellular metabolic flux distributions. In this mini-review, we describe each of these methods published to date with categorizing them based on four different grouping criteria (requirement for multiple gene expression datasets as input, requirement for a threshold to define a gene's high and low expression, requirement for a priori assumption of an appropriate objective function, and validation of predicted fluxes directly against measured intracellular fluxes). Then, we recommend which group of methods would be more suitable from a practical perspective.
    Computational and Structural Biotechnology Journal. 08/2014;


Available from
Jun 6, 2014