Widespread RNA and DNA Sequence Differences in the Human Transcriptome

Department of Biostatistics, University of Pennsylvania School of Medicine, Philadelphia, PA 19104, USA.
Science (Impact Factor: 31.48). 05/2011; 333(6038):53-8. DOI: 10.1126/science.1207018
Source: PubMed

ABSTRACT The transmission of information from DNA to RNA is a critical process. We compared RNA sequences from human B cells of 27 individuals to the corresponding DNA sequences from the same individuals and uncovered more than 10,000 exonic sites where the RNA sequences do not match that of the DNA. All 12 possible categories of discordances were observed. These differences were nonrandom as many sites were found in multiple individuals and in different cell types, including primary skin cells and brain tissues. Using mass spectrometry, we detected peptides that are translated from the discordant RNA sequences and thus do not correspond exactly to the DNA sequences. These widespread RNA-DNA differences in the human transcriptome provide a yet unexplored aspect of genome variation.

1 Bookmark
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Advances in sequencing technology have allowed for detailed analyses of the transcriptome at single-nucleotide resolution, facilitating the study of RNA editing or sequence differences between RNA and DNA genome-wide. In humans, two types of post-transcriptional RNA editing processes are known to occur: A-to-I deamination by ADAR and C-to-U deamination by APOBEC1. In addition to these sequence differences, researchers have reported the existence of all 12 types of RNA-DNA sequence differences (RDDs); however, the validity of these claims is debated, as many studies claim that technical artifacts account for the majority of these non-canonical sequence differences. In this study, we used a detection theory approach to evaluate the performance of RNA-Sequencing (RNA-Seq) and associated aligners in accurately identifying RNA-DNA sequence differences. By generating simulated RNA-Seq datasets containing RDDs, we assessed the effect of alignment artifacts and sequencing error on the sensitivity and false discovery rate of RDD detection. Overall, we found that even in the presence of sequencing errors, false negative and false discovery rates of RDD detection can be contained below 10% with relatively lenient thresholds. We also assessed the ability of various filters to target false positive RDDs and found them to be effective in discriminating between true and false positives. Lastly, we used the optimal thresholds we identified from our simulated analyses to identify RDDs in a human lymphoblastoid cell line. We found approximately 6,000 RDDs, the majority of which are A-to-G edits and likely to be mediated by ADAR. Moreover, we found the majority of non A-to-G RDDs to be associated with poorer alignments and conclude from these results that the evidence for widespread non-canonical RDDs in humans is weak. Overall, we found RNA-Seq to be a powerful technique for surveying RDDs genome-wide when coupled with the appropriate thresholds and filters.
    PLoS ONE 11/2014; 9(11):e112040. DOI:10.1371/journal.pone.0112040 · 3.53 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Adenosine-to-inosine (A-to-I) RNA editing is site-specific modification of RNAs that increases the diversity of the transcriptome and proteome. Most insects undergo complete metamorphosis, including four life cycle stages: egg, larva, pupa and adult. Many previous studies have confirmed that RNA-editing events occur in a development-specific manner; in other words, RNA-editing levels change during metamorphosis. Here, we describe an effort to identify the developmental specificity of RNA-editing events using a large-scale computational analysis of RNA-seq data derived from four developmental stages of the diamondback moth, Plutella xylostella. One thousand one hundred and eighty-seven A-to-I RNA-editing sites were predicted to be developmental stage specific (false-discovery rate <0.01) and 1,094 of these sites were located in protein-coding regions. Editing of 152 sites resulted in an altered amino acid residue. A putative adult-specific A-to-I RNA-editing site was verified by comparing cDNA sequences with its corresponding genomic locus at different stages of the P. xylostella life cycle. Our findings will help elucidate the role of A-to-I RNA editing in the regulation of metamorphosis. Further studies detailing changes in the extent of editing are needed to establish how as yet unknown regulatory factors are involved in the editing mechanism and what biological functions' editing serves.
    Molecular Genetics and Genomics 12/2014; DOI:10.1007/s00438-014-0968-4 · 2.83 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Cells control dynamic transitions in transcript levels by regulating transcription, processing, and/or degradation through an integrated regulatory strategy. Here, we combine RNA metabolic labeling, rRNA-depleted RNA-seq, and DRiLL, a novel computational framework, to quantify the level; editing sites; and transcription, processing, and degradation rates of each transcript at a splice junction resolution during the LPS response of mouse dendritic cells. Four key regulatory strategies, dominated by RNA transcription changes, generate most temporal gene expression patterns. Noncanonical strategies that also employ dynamic posttranscriptional regulation control only a minority of genes, but provide unique signal processing features. We validate Tristetraprolin (TTP) as a major regulator of RNA degradation in one noncanonical strategy. Applying DRiLL to the regulation of noncoding RNAs and to zebrafish embryogenesis demonstrates its broad utility. Our study provides a new quantitative approach to discover transcriptional and posttranscriptional events that control dynamic changes in transcript levels using RNA sequencing data. Copyright © 2014 Elsevier Inc. All rights reserved.
    Cell 12/2014; 159(7). DOI:10.1016/j.cell.2014.11.015 · 33.12 Impact Factor

Full-text (2 Sources)

Available from
Jun 10, 2014