Genome-Wide Mapping of in Vivo Protein-DNA Interactions

Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305-5120, USA.
Science (Impact Factor: 33.61). 07/2007; 316(5830):1497-502. DOI: 10.1126/science.1141319
Source: PubMed

ABSTRACT

In vivo protein-DNA interactions connect each transcription factor with its direct targets to form a gene network scaffold.
To map these protein-DNA interactions comprehensively across entire mammalian genomes, we developed a large-scale chromatin
immunoprecipitation assay (ChIPSeq) based on direct ultrahigh-throughput DNA sequencing. This sequence census method was then
used to map in vivo binding of the neuron-restrictive silencer factor (NRSF; also known as REST, for repressor element–1 silencing
transcription factor) to 1946 locations in the human genome. The data display sharp resolution of binding position [±50 base
pairs (bp)], which facilitated our finding motifs and allowed us to identify noncanonical NRSF-binding motifs. These ChIPSeq
data also have high sensitivity and specificity [ROC (receiver operator characteristic) area ≥ 0.96] and statistical confidence
(P <10–4), properties that were important for inferring new candidate interactions. These include key transcription factors in the
gene network that regulates pancreatic islet cell development.

Download full-text

Full-text

Available from: Richard M Myers
  • Source
    • "Thus, the broader picture that has emerged is that the stress response transcription factor pathways are not isolated regulatory units but rather cooperate and co-opt each other to modify the cell's functions in a complex manner. High-throughput sequencing tools have become widespread in gene expression studies (Metzker, 2010; Wang, Gerstein & Snyder, 2009; Johnson et al., 2007). For example, RNAseq has become a powerful tool for analyzing differential gene expression by quantifying the RNA abundance of the transcriptome. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Here we present a genome-wide method for de novo identification of enhancer regions. This approach enables massively parallel empirical investigation of DNA sequences that mediate transcriptional activation and provides a platform for discovery of regulatory modules capable of driving context-specific gene expression. The method links fragmented genomic DNA to the transcription of randomer molecule identifiers and measures the functional enhancer activity of the library by massively parallel sequencing. We transfected a Drosophila melanogaster library into S2 cells in normoxia and hypoxia, and assayed 4,599,881 genomic DNA fragments in parallel. The locations of the enhancer regions strongly correlate with genes up-regulated after hypoxia and previously described enhancers. Novel enhancer regions were identified and integrated with RNAseq data and transcription factor motifs to describe the hypoxic response on a genome-wide basis as a complex regulatory network involving multiple stress-response pathways. This work provides a novel method for high-throughput assay of enhancer activity and the genome-scale identification of 31 hypoxia-activated enhancers in Drosophila .
    Full-text · Article · Dec 2015 · PeerJ
  • Source
    • "For example, the technology of Chromatin immunoprecipitation (ChIP) followed by sequencing (ChIP-Seq) can be applied to quantify the genome-wide binding occupancy of those protein-DNA interactions. [11], [12]. "
    [Show abstract] [Hide abstract]
    ABSTRACT: With the prevalence of chromatin immunoprecipitation (ChIP) with sequencing (ChIP-Seq) technology, massive ChIP-Seq data has been accumulated. The ChIP-Seq technology measures the genome-wide occupancy of DNA-binding proteins in vivo. It is well-known that different DNA-binding protein occupancies may result in a gene being regulated in different conditions (e.g. different cell types). To fully understand a gene's function, it is essential to develop probabilistic models on multiple ChIP-Seq profiles for deciphering the gene transcription causalities. In this work, we propose and describe two probabilistic models. Assuming the conditional independence of different DNA-binding proteins' occupancies, the first method (SignalRanker) is developed as an intuitive method for ChIP-Seq genome-wide signal profile inference. Unfortunately, such an assumption may not always hold in some gene regulation cases. Thus, we propose and describe another method (FullSignalRanker) which does not make the conditional independence assumption. The proposed methods are compared with other existing methods on ENCODE ChIP-Seq datasets, demonstrating its regression and classification ability. The results suggest that FullSignalRanker is the best-performing method for recovering the signal ranks on the promoter and enhancer regions. In addition, FullSignalRanker is also the best-performing method for peak sequence classification. We envision that SignalRanker and FullSignalRanker will become important in the era of next generation sequencing. FullSignalRanker program is available on the following website: http://www.cs.toronto.edu/ ∼ wkc/FullSignalRanker/.
    Full-text · Article · Dec 2015 · IEEE/ACM Transactions on Computational Biology and Bioinformatics
  • Source
    • "All TFs recognize short DNA sequences (6– 12 bp) and their motifs show varying levels of degeneracy (D'haeseleer 2006). ChIP-seq experiments allow the identification of regions in the genome that are bound by TFs (Johnson et al. 2007). Analyses of a wide variety of different TFs have shown that only a small subset of possible binding sites is occupied by the TF (Carr and Biggin 1999; Iyer et al. 2001; Yang et al. 2006). "
    [Show abstract] [Hide abstract]
    ABSTRACT: Macrophages populate every tissue of the body and play vital roles in homeostasis, pathogen elimination, and tissue healing. These cells possess the ability to adapt to a multitude of abruptly changing and complex environments. Furthermore, different populations of resident tissue macrophages each show their own defining gene signatures. The enhancer repertoire of these cells underlies both the cellular identity of a given subset of resident macrophage population and their ability to dynamically alter, in an efficient manner, their gene expression programs in response to internal and external signals. Notably, transcription is pervasive at active enhancers and enhancer RNAs, or eRNAs, are tightly correlated to regulated transcription of protein-coding genes. Furthermore, selection and establishment of enhancers is a dynamic and plastic process in which activation of intracellular signaling pathways by factors present in a macrophage's environment play a determining role. Here, we review recent studies providing insights into the distinct mechanisms that contribute to the selection and function of enhancers in macrophages and the relevance of studying these mechanisms to gain a better understanding of complex human diseases.
    Preview · Article · Nov 2015 · Cold Spring Harbor Symposia on Quantitative Biology
Show more