Discover Regulatory DNA Elements Using Chromatin Signatures and Artificial Neural Network

Department of Internal Medicine, University of Iowa, 2294 CBRB, 285 Newton Road, Iowa City, IA 52242, USA.
Bioinformatics (Impact Factor: 4.98). 07/2010; 26(13):1579-86. DOI: 10.1093/bioinformatics/btq248
Source: PubMed


Recent large-scale chromatin states mapping efforts have revealed characteristic chromatin modification signatures for various types of functional DNA elements. Given the important influence of chromatin states on gene regulation and the rapid accumulation of genome-wide chromatin modification data, there is a pressing need for computational methods to analyze these data in order to identify functional DNA elements. However, existing computational tools do not exploit data transformation and feature extraction as a means to achieve a more accurate prediction.
We introduce a new computational framework for identifying functional DNA elements using chromatin signatures. The framework consists of a data transformation and a feature extraction step followed by a classification step using time-delay neural network. We implemented our framework in a software tool CSI-ANN (chromatin signature identification by artificial neural network). When applied to predict transcriptional enhancers in the ENCODE region, CSI-ANN achieved a 65.5% sensitivity and 66.3% positive predictive value, a 5.9% and 11.6% improvement, respectively, over the previously best approach.
CSI-ANN is implemented in Matlab. The source code is freely available at
Supplementary Materials are available at Bioinformatics online.

Full-text preview

Available from:
  • Source
    • "Meanwhile, indirect methods use correlational analysis of enhancer regions with some landmark DNA features for the inference of approximate locations—e.g., CpG island, chromatin or histone marks. Extensive studies on indirect methods are focused mainly in the generation and modelling of discriminative features from landmarks of supervised learning [2] [3]. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Using Genetic Algorithm, this paper presents a modelling method to generate novel logical-based features from DNA sequences enriched with H3K4mel histone signatures. Current histone signature is mostly represented using k-mers content features incapable of representing all the possible complex interactions of various DNA segments. The main contributions are, among others: (a) demonstrating that there are complex interactions among sequence segments in the histone regions; (b) developing a parse tree representation of the logical complex features. The proposed novel feature is compared to the k-mers content features using datasets from the mouse (mm9) genome. Evaluation results show that the new feature improves the prediction performance as shown by f-measure for all datasets tested. Also, it is discovered that tree-based features generated from a single chromosome can be generalized to predict histone marks in other chromosomes not used in the training. These findings have a great impact on feature design considerations for histone signatures as well as other classifier design features.
    Full-text · Article · Sep 2014 · Bio-medical materials and engineering
  • Source
    • "Recent advances in high-throughput technologies such as ChIP-seq have led to the discoveries that various regulatory sequences are characterized by distinct patterns of histone modifications, which have increasingly been used as biochemical signatures for annotation of the genome (Rivera and Ren 2013). For instance, combinations of H3K4me1 and H3K4me3 (Heintzman et al. 2007) have been exploited for the identification of enhancers and promoters in mammalian genomes (Won et al. 2008; Firpi et al. 2010; Fernandez and Miranda-Saavedra 2012; Rajagopal et al. 2013). Similarly, combination patterns of H3K4me3 and H3K36me3 were used to uncover a large number of long intergenic noncoding (linc) genes (Guttman et al. 2009 ). "
    [Show abstract] [Hide abstract]
    ABSTRACT: In eukaryotic cells, histone lysines are frequently acetylated. However, unlike modifications such as methylation, histone acetylation is often considered redundant. As such, the functional roles of distinct histone acetylations are largely unexplored. We previously developed an algorithm RFECS to discover the most informative modifications associated with the classification or prediction of genome-wide enhancers. Here, we use this tool to identify the modifications most predictive of promoters, enhancers, and gene bodies. Surprisingly, we find that histone acetylation alone performs well in distinguishing these unique genomic regions. Further, we find the association of characteristic acetylation patterns with genic regions and provide novel insights into the association of chromatin state with splicing. Taken together, our work underscores the diverse functional roles of histone acetylation in gene regulation, and provides several testable hypotheses to dissect these roles.
    Full-text · Article · Aug 2014 · G3-Genes Genomes Genetics
  • Source
    • "A marker radioisotope attaches to a specific radioligand, which we contains a chemical affinity for the specific tissue of interest. This marriage allows the combination of ligand and radioisotope to be carried and attached to the specific area of interest in the target organ [13]. Subsequently, due to the gamma-emission of the isotope [8], the ligand concentration is visualized by a gamma-camera. "
    [Show abstract] [Hide abstract]
    ABSTRACT: The recent introduction of high-resolution molecular imaging technology is considered by many experts as a major breakthrough that will potentially lead to a revolutionary paradigm shift in health care and revolutionize clinical practice. This paper explores the challenges and strengths of the current major imaging modalities, as well as the biophysics engineering their repertoire of capabilities. Advancements in the mechanical aspects of both PET and SPECT imaging will advance molecular imaging diagnostic capabilities and have a direct impact on clinical medicine and biomedical research practice. A better understanding of the strengths and limitations of functional imaging modalities in the context of their particular hardware and software mechanics will shed light onto how we can advance their diagnostic capabilities on a biological level. Herein, this paper demonstrates the fundamental biomechanical differences between PET and SPECT imaging, and how these fundamental differences translate into clinically relevant data acquisition for brain disorders.
    Full-text · Article · Jul 2013
Show more