Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project The ENCODE Project Consortium Nature 2007 447 799 816

University of Lausanne, Lausanne, Vaud, Switzerland
Nature (Impact Factor: 41.46). 07/2007; 447(7146):799-816. DOI: 10.1038/nature05874
Source: PubMed


We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.

Download full-text


Available from: Matthew J Oberley, Aug 25, 2014
  • Source
    • "To date, the majority of biomarker efforts have focused on proteincoding genes, which comprise only a subset of all transcribed genes [2] [3]. Among the more than 90% of transcription that generates noncoding genes, long noncoding RNAs (lncRNAs) most closely resemble protein-coding genes in that they are transcribed by RNA polymerase II, polyadenylated, and associated with specific epigenetic signatures (i.e., H3K4me3 at the promoter and H3K36me3 throughout the gene length) [4] [5]. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Long noncoding RNAs (lncRNAs) are an emerging class of oncogenic molecules implicated in a diverse range of human malignancies. We recently identified SChLAP1 as a novel lncRNA that demonstrates outlier expression in a subset of prostate cancers, promotes tumor cell invasion and metastasis, and associates with lethal disease. Based on these findings, we sought to develop an RNA in situ hybridization (ISH) assay for SChLAP1 to 1) investigate the spectrum of SChLAP1 expression from benign prostatic tissue to metastatic castration-resistant prostate cancer and 2) to determine whether SChLAP1 expression by ISH is associated with outcome after radical prostatectomy in patients with clinically localized disease. The results from our current study demonstrate that SChLAP1 expression increases with prostate cancer progression, and high SChLAP1 expression by ISH is associated with poor outcome after radical prostatectomy in patients with clinically localized prostate cancer by both univariate (hazard ratio = 2.343, P = .005) and multivariate (hazard ratio = 1.99, P = .032) Cox regression analyses. This study highlights a potential clinical utility for SChLAP1 ISH as a novel tissue-based biomarker assay for outcome prognostication after radical prostatectomy.
    Full-text · Article · Dec 2014 · Neoplasia (New York, N.Y.)
  • Source
    • "Higher-density reference panels including more individuals from more diverse population groups will improve the utility of imputation in fine-mapping studies, providing more complete coverage of genetic variation across ethnicities without the need for re-sequencing. Advances in statistical method development, incorporating improved understanding of the genome from the ENCODE Project Consortium [42, 43], will augment causal variant localisation and provide further acumen as to the mechanisms through which GWAS loci influence T2D susceptibility, with the ultimate goal of translation of these findings into clinical practice and the resulting public health benefits. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Genome-wide association studies of type 2 diabetes have been extremely successful in discovering loci that contribute genetic effects to susceptibility to the disease. However, at the vast majority of these loci, the variants and transcripts through which these effects on type 2 diabetes are mediated are unknown, limiting progress in defining the pathophysiological basis of the disease. In this review, we will describe available approaches for assaying genetic variation across loci and discuss statistical methods to determine the most likely causal variants in the region. We will consider the utility of trans-ethnic meta-analysis for fine mapping by leveraging the differences in the structure of linkage disequilibrium between diverse populations. Finally, we will discuss progress in fine-mapping type 2 diabetes susceptibility loci to date and consider the prospects for future efforts to localise causal variants for the disease.
    Preview · Article · Nov 2014 · Current Diabetes Reports
  • Source
    • "For this definition, it somewhat arbitrary could not distinguish lncRNAs from small regulatory RNAs. Now, there have been identified far greater amounts of lncRNAs than protein coding genes [7-9]. A majority of annotated eukaryotic protein-coding ORFs were characterized with high level of phylogenetic diversity and the conservation. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Long non-coding RNAs (lncRNAs) are non-protein coding transcripts longer than 200 nucleotides. The post-transcriptional regulation is influenced by these lncRNAs by interfering with the microRNA pathways, involving in diverse cellular processes. The regulation of gene expression by lncRNAs at the epigenetic level, transcriptional and post-transcriptional level have been well known and widely studied. Recent recognition that lncRNAs make effects in many biological and pathological processes such as stem cell pluripotency, neurogenesis, oncogenesis and etc. This review will focus on the functional roles of lncRNAs in epigenetics and related research progress will be summarized.
    Full-text · Article · Sep 2014 · Biological Procedures Online
Show more