Transcription Factor Binding Profiles Reveal Cyclic Expression of Human Protein-coding Genes and Non-coding RNAs.

Institute for Quantitative Biomedical Sciences, Norris Cotton Cancer Center, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire, United States of America.
PLoS Computational Biology (Impact Factor: 4.83). 07/2013; 9(7):e1003132. DOI: 10.1371/journal.pcbi.1003132
Source: PubMed

ABSTRACT Cell cycle is a complex and highly supervised process that must proceed with regulatory precision to achieve successful cellular division. Despite the wide application, microarray time course experiments have several limitations in identifying cell cycle genes. We thus propose a computational model to predict human cell cycle genes based on transcription factor (TF) binding and regulatory motif information in their promoters. We utilize ENCODE ChIP-seq data and motif information as predictors to discriminate cell cycle against non-cell cycle genes. Our results show that both the trans- TF features and the cis- motif features are predictive of cell cycle genes, and a combination of the two types of features can further improve prediction accuracy. We apply our model to a complete list of GENCODE promoters to predict novel cell cycle driving promoters for both protein-coding genes and non-coding RNAs such as lincRNAs. We find that a similar percentage of lincRNAs are cell cycle regulated as protein-coding genes, suggesting the importance of non-coding RNAs in cell cycle division. The model we propose here provides not only a practical tool for identifying novel cell cycle genes with high accuracy, but also new insights on cell cycle regulation by TFs and cis-regulatory elements.

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: IntroductionGenetic and molecular signatures have been incorporated into cancer prognosis prediction and treatment decisions with good success over the past decade. Clinically, these signatures are usually used in early-stage cancers to evaluate whether they require adjuvant therapy following surgical resection. A molecular signature that is prognostic across more clinical contexts would be a useful addition to current signatures.Method We defined a signature for the ubiquitous tissue factor, E2F4, based on its shared target genes in multiple tissues. These target genes were identified by Chromatin Immunoprecipitation Sequencing (ChIP-seq) experiments using a probabilistic method. We then computationally calculated the regulatory activity score (RAS) of E2F4 in cancer tissues, and examined how E2F4 RAS correlates with patient survival.ResultsGenes in our E2F4 signature were 21-fold more likely to be correlated with breast cancer patient survival time compared to randomly selected genes. Using eight independent breast cancer datasets containing over 1,900 unique samples, we stratified patients into low and high E2F4 RAS groups. E2F4 activity stratification was highly predictive of patient outcome, and our results remained robust even when controlling for many factors including patient age, tumor size, grade, estrogen receptor (ER) status, lymph node status, whether the patient received adjuvant therapy, and the patient¿s other prognostic indices such as Adjuvant! and the Nottingham Prognostic Index scores. Furthermore, the fractions of samples with positive E2F4 RAS vary in different intrinsic breast cancer subtypes, consistent with the different survival profiles of these subtypes.Conclusion We defined a prognostic signature, the E2F4 regulatory activity score, and showed it to be significantly predictive of patient outcome in breast cancer regardless of treatment status and the states of many other clinicopathological variables. It can be used in conjunction with other breast cancer classification methods such as Oncotype DX to improve clinical outcome prediction.
    Breast cancer research: BCR 12/2014; 16(6):486. DOI:10.1186/s13058-014-0486-7 · 5.88 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Interpreting gene expression profiles often involves statistical analysis of large numbers of differentially expressed genes, isoforms, and alternative splicing events at either static or dynamic spectrums. Reduced sequencing costs have made feasible dense time-series analysis of gene expression using RNA-seq; however, statistical methods in the context of temporal RNA-seq data are poorly developed. Here we will review current methods for identifying temporal changes in gene expression using RNA-seq, which are limited to static pairwise comparisons of time points and which fail to account for temporal dependencies in gene expression patterns. We also review recently developed very few number of temporal dynamic RNA-seq specific methods. Application and development of RNA-specific temporal dynamic methods have been continuously under the development, yet, it is still in infancy. We fully cover microarray specific temporal methods and transcriptome studies in initial digital technology (e.g., SAGE) between traditional microarray and new RNA-seq.
    Frontiers in Genetics 02/2014; 5:35. DOI:10.3389/fgene.2014.00035
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Despite growing appreciation of the importance of epigenetics in breast cancer, our understanding of epigenetic alterations of non-coding RNAs (ncRNAs) in breast cancer remains limited. Here, we explored the epigenetic patterns of ncRNAs in breast cancers using published sequencing-based methylome data, primarily focusing on the two most commonly studied ncRNA biotypes, long ncRNAs and miRNAs. We observed widely aberrant methylation in the promoters of ncRNAs, and this abnormal methylation was more frequent than that in protein-coding genes. Specifically, intergenic ncRNAs were observed to comprise a majority (51.45% of the lncRNAs and 51.57% of the miRNAs) of the aberrantly methylated ncRNA promoters. Moreover, we summarized five patterns of aberrant ncRNA promoter methylation in the context of genomic CpG islands (CGIs), in which aberrant methylation occurred not only on CGIs, but also in regions flanking CGI and in CGI-lacking promoters. Integration with transcriptional datasets enabled us to determine that the ncRNA promoter methylation events were associated with transcriptional changes. Furthermore, a panel of ncRNAs were identified as biomarkers that discriminated between disease phenotypes. Finally, the potential functions of aberrantly methylated ncRNAs were predicted, suggestiong that ncRNAs and coding genes cooperatively mediate pathway dysregulation during the development and progression of breast cancer.
    Scientific Reports 03/2015; 5. DOI:10.1038/srep08790 · 5.08 Impact Factor

Full-text (2 Sources)

Available from
May 22, 2014