Article

RIPSeeker: a statistical package for identifying protein-associated transcripts from RIP-seq experiments.

Department of Computer Science, University of Toronto, Toronto, Ontario, M5S 2E4, Canada The Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada, Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A4, Canada and Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada.
Nucleic Acids Research (Impact Factor: 8.81). 02/2013; DOI: 10.1093/nar/gkt142
Source: PubMed

ABSTRACT RIP-seq has recently been developed to discover genome-wide RNA transcripts that interact with a protein or protein complex. RIP-seq is similar to both RNA-seq and ChIP-seq, but presents unique properties and challenges. Currently, no statistical tool is dedicated to RIP-seq analysis. We developed RIPSeeker (http://www.bioconductor.org/packages/2.12/bioc/html/RIPSeeker.html), a free open-source Bioconductor/R package for de novo RIP peak predictions based on HMM. To demonstrate the utility of the software package, we applied RIPSeeker and six other published programs to three independent RIP-seq datasets and two PAR-CLIP datasets corresponding to six distinct RNA-binding proteins. Based on receiver operating curves, RIPSeeker demonstrates superior sensitivity and specificity in discriminating high-confidence peaks that are consistently agreed on among a majority of the comparison methods, and dominated 9 of the 12 evaluations, averaging 80% area under the curve. The peaks from RIPSeeker are further confirmed based on their significant enrichment for biologically meaningful genomic elements, published sequence motifs and association with canonical transcripts known to interact with the proteins examined. While RIPSeeker is specifically tailored for RIP-seq data analysis, it also provides a suite of bioinformatics tools integrated within a self-contained software package comprehensively addressing issues ranging from post-alignments' processing to visualization and annotation.

0 Followers
 · 
211 Views
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Methylated RNA Immunoprecipatation combined with RNA sequencing (MeRIP-seq) is revolutionizing the de novo study of RNA epigenomics at a higher resolution. However, this new technology poses unique bioinformatics problems that call for novel and sophisticated statistical computational solutions, aiming at identifying and characterizing transcriptome-wide methyltranscriptome. We developed HEP, a Hidden Markov Model (HMM)-based Exome Peak-finding algorithm for predicting transcriptome methylation sites using MeRIP-seq data. In contrast to exomePeak, our previously developed MeRIP-seq peak calling algorithm, HEPeak models the correlation between continuous bins in an m6A peak region and it is a model-based approach, which admits rigorous statistical inference. HEPeak was evaluated on a simulated MeRIP-seq dataset and achieved higher sensitivity and specificity than exomePeak. HEPeak was also applied to real MeRIP-seq datasets from human HEK293T cell line and mouse midbrain cells and was shown to be able to recapitulate known m6A distribution in transcripts and identify novel m6A sites in long non-coding RNAs. In this paper, a novel HMM-based peak calling algorithm, HEPeak, was developed for peak calling for MeRIP-seq data. HEPeak is written in R and is publicly available.
    BMC Genomics 04/2015; 16(Suppl 4):S2. DOI:10.1186/1471-2164-16-S4-S2 · 4.04 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: In recent year, increasing evidence suggests that noncoding RNAs play important roles in the regulation of tissue homeostasis and pathophysiological conditions. Besides small noncoding RNAs (eg, microRNAs), >200-nucleotide long transcripts, namely long noncoding RNAs (lncRNAs), can interfere with gene expressions and signaling pathways at various stages. In the cardiovascular system, studies have detected and characterized the expression of lncRNAs under normal physiological condition and in disease states. Several lncRNAs are regulated during acute myocardial infarction (eg, Novlnc6) and heart failure (eg, Mhrt), whereas others control hypertrophy, mitochondrial function and apoptosis of cardiomyocytes. In the vascular system, the endothelial-expressed lncRNAs (eg, MALAT1 and Tie-1-AS) can regulate vessel growth and function, whereas the smooth-muscle-expressed lncRNA smooth muscle and endothelial cell-enriched migration/differentiation-associated long noncoding RNA was recently shown to control the contractile phenotype of smooth muscle cells. This review article summarizes the data on lncRNA expressions in mouse and human and highlights identified cardiovascular lncRNAs that might play a role in cardiovascular diseases. Although our understanding of lncRNAs is still in its infancy, these examples may provide helpful insights how lncRNAs interfere with cardiovascular diseases. © 2015 American Heart Association, Inc.
    Circulation Research 02/2015; 116(4):737-750. DOI:10.1161/CIRCRESAHA.116.302521 · 11.09 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The pervasive transcription of the genome creates many types of non-coding RNAs (ncRNAs). However, we know very little regarding the functions and the regulatory mechanisms of these ncRNAs. Exploring the interactions of RNA and RNA binding proteins (RBPs) is vital because it can allow us to truly understand how these ncRNAs behave in vivo. High-throughput sequencing of RNA isolated by cross-linking immunoprecipitation (HITS-CLIP or CLIP-seq) and its variants have been successfully used as systemic techniques to study RBP binding sites. In this review, we will explain the major differences between the CLIP techniques, summarize successful applications of these techniques, discuss limitations of CLIP, present some suggested solutions and project their promising future roles in studying the RNA world.
    Science China. Life sciences 01/2015; 58(1):75-88. DOI:10.1007/s11427-014-4764-5 · 1.51 Impact Factor

Similar Publications