Article

Bovine ncRNAs Are Abundant, Primarily Intergenic, Conserved and Associated with Regulatory Genes

School of Molecular and Biomedical Science, The University of Adelaide, Adelaide, South Australia, Australia.
PLoS ONE (impact factor: 4.09). 08/2012; 7. DOI:10.1371/journal.pone.0042638
Source: PubMed

ABSTRACT It is apparent that non-coding transcripts are a common feature of higher organisms and encode uncharacterized layers of genetic regulation and information. We used public bovine EST data from many developmental stages and tissues, and developed a pipeline for the genome wide identification and annotation of non-coding RNAs (ncRNAs). We have predicted 23,060 bovine ncRNAs, 99% of which are un-annotated, based on known ncRNA databases. Intergenic transcripts accounted for the majority (57%) of the predicted ncRNAs and the occurrence of ncRNAs and genes were only moderately correlated (r = 0.55, p-value,2.2e-16). Many of these intergenic non-coding RNAs mapped close to the 39 or 59 end of thousands of genes and many of these were transcribed from the opposite strand with respect to the closest gene, particularly regulatory-related genes. Conservation analyses showed that these ncRNAs were evolutionarily conserved, and many intergenic ncRNAs proximate to genes contained sequence-specific motifs. Correlation analysis of expression between these intergenic ncRNAs and protein-coding genes using RNA-seq data from a variety of tissues showed significant correlations with many transcripts. These results support the hypothesis that ncRNAs are common, transcribed in a regulated fashion and have regulatory functions. Copyright: ß 2012 Qu, Adelson. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

0 0
 · 
0 Bookmarks
 · 
62 Views
  • Source
    Article: Tagging mammalian transcription complexity.
    [show abstract] [hide abstract]
    ABSTRACT: The nature of the 'transcriptome' is more complex than first realized. Although CAGE, various tagging technologies and tiling arrays show that most of the mammalian genome is transcribed, a large proportion of transcripts do not encode proteins and are either poorly polyadenylated, involved in sense-antisense pairs or never leave the nucleus. In this article, I review the various techniques and data sets that are currently used to measure gene transcription and the evidence that reveals the true extent of transcription in mammalian genomes. The next few years will see efforts to identify novel transcripts systematically and decipher their function. A deeper understanding of transcriptional complexity might even lead us to redefine what we mean by the term 'gene'.
    Trends in Genetics 10/2006; 22(9):501-10. · 10.06 Impact Factor
  • Source
    Article: The complexity of the mammalian transcriptome.
    [show abstract] [hide abstract]
    ABSTRACT: A comprehensive understanding of protein and regulatory networks is strictly dependent on the complete description of the transcriptome of cells. After the determination of the genome sequence of several mammalian species, gene identification is based on in silico predictions followed by evidence of transcription. Conservative estimates suggest that there are about 20,000 protein-encoding genes in the mammalian genome. In the last few years the combination of full-length cDNA cloning, cap-analysis gene expression (CAGE) tag sequencing and tiling arrays experiments have unveiled unexpected additional complexities in the transcriptome. Here we describe the current view of the mammalian transcriptome focusing on transcripts diversity, the growing non-coding RNA world, the organization of transcriptional units in the genome and promoter structures. In-depth analysis of the brain transcriptome has been challenging due to the cellular complexity of this organ. Here we present a computational analysis of CAGE data from different regions of the central nervous system, suggesting distinctive mechanisms of brain-specific transcription.
    The Journal of Physiology 10/2006; 575(Pt 2):321-32. · 4.72 Impact Factor
  • Article: The amazing complexity of the human transcriptome

Full-text (2 Sources)

View
38 Downloads
Available from
9 Aug 2012

Keywords

common feature
 
Correlation analysis
 
Creative Commons Attribution License
 
higher organisms
 
intergenic ncRNAs
 
intergenic ncRNAs proximate
 
intergenic non-coding RNAs mapped
 
Intergenic transcripts
 
ncRNA databases
 
non-coding RNAs
 
non-coding transcripts
 
open-access article
 
permits unrestricted use
 
predicted ncRNAs
 
public bovine EST data
 
regulated fashion
 
regulatory-related genes
 
results support
 
RNA-seq data
 
significant correlations