Identification of mitochondrial disease genes through integrative analysis of multiple datasets.

European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg, Germany.
Methods (Impact Factor: 3.22). 11/2008; 46(4):248-55. DOI: 10.1016/j.ymeth.2008.10.002
Source: PubMed

ABSTRACT Determining the genetic factors in a disease is crucial to elucidating its molecular basis. This task is challenging due to a lack of information on gene function. The integration of large-scale functional genomics data has proven to be an effective strategy to prioritize candidate disease genes. Mitochondrial disorders are a prevalent and heterogeneous class of diseases that are particularly amenable to this approach. Here we explain the application of integrative approaches to the identification of mitochondrial disease genes. We first examine various datasets that can be used to evaluate the involvement of each gene in mitochondrial function. The data integration methodology is then described, accompanied by examples of common implementations. Finally, we discuss how gene networks are constructed using integrative techniques and applied to candidate gene prioritization. Relevant public data resources are indicated. This report highlights the success and potential of data integration as well as its applicability to the search for mitochondrial disease genes.

  • [Show abstract] [Hide abstract]
    ABSTRACT: This paper proposes Bayesian networks (BNs) that combine polarization corrected temperature (PCT) and scattering index (SI) methods to identify rainfall intensity. To learn BN network structures, meta-heuristic techniques including tabu search (TS), simulated annealing (SA) and genetic algorithm (GA) were empirically evaluated and compared for efficiency. The proposed models were applied to the Tanshui river basin in Taiwan. The meteorological data from the Special Sensor Microwave/Imager (SSM/I) of the National Oceanic and Atmospheric Administration (NOAA) comprises seven passive microwave brightness temperatures, and was used to detect rain rates. The data consisted of 71 typhoons affecting the watershed during 2000–2012. A preliminary analysis using simple meta-heuristic BNs identified the main attributes, namely the brightness temperatures of 19, 22, 37 and 85 GHz for rainfall retrieval. Based on the preliminary analysis of a simple BN run, the advanced BNs combined with SI and PCT successfully demonstrated improved rain rate retrieval accuracy. To compare the proposed meta-heuristic BNs, the traditional SI method, the SI-based support vector regression model (SI-SVR), and artificial neural network (ANN) were used as benchmarks. The results showed that (1) meta-heuristic BN techniques can be used to identify the vital attributes of the rainfall retrieval problem and their causal relationships and (2) according to a comparison of BNs combined with PCT and SI and artificial intelligence (AI)-based models (SI-SVR and ANN), in heavy, torrential, and pouring rainfall, models of BNs combined with PCT and SI provide a superior retrieval performance than that of AI-based models. Therefore, this study confirms that meta-heuristic BNs combined with PCT and SI is an efficient tool for addressing rainfall retrieval problems.
    Neurocomputing 07/2014; 136:71–81. DOI:10.1016/j.neucom.2014.01.030 · 2.01 Impact Factor
  • Source
    Annals of Laboratory Medicine 01/2014; 34(1):71-5. DOI:10.3343/alm.2014.34.1.71 · 1.48 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: High-throughput -omics data can be combined with large-scale molecular interaction networks, e.g., protein-protein interaction networks, to provide a unique framework for the investigation of human molecular biology. Interest in these integrative -omics methods is growing rapidly because of their potential to understand complexity and association with disease; such approaches have a focus on associations between phenotype and 'network-type'. The potential of this research is enticing yet there remain a series of important considerations. Here we discuss interaction data selection, data quality, the relative merits of using data from large high throughput studies versus a meta-database of smaller literature-curated studies, and possible issues of sociological or inspection bias in interaction data. Other work underway, especially international consortia to establish data formats, quality standards and address data redundancy, and the improvements these efforts are making to the field, is also evaluated. We present options for researchers intending to use large-scale molecular interaction networks as a functional context for protein or gene expression data, including microRNAs, especially in the context of human disease. This article is protected by copyright. All rights reserved.
    Proteomics 12/2013; 13(23-24). DOI:10.1002/pmic.201200570 · 3.97 Impact Factor

Full-text (2 Sources)

Available from
Jun 10, 2014