Characterization of constitutive CTCF/cohesin loci: A possible role in establishing topological domains in mammalian genomes

BMC Genomics (Impact Factor: 3.99). 08/2013; 14(1):553. DOI: 10.1186/1471-2164-14-553
Source: PubMed


Recent studies suggested that human/mammalian genomes are divided into large, discrete domains that are units of chromosome organization. CTCF, a CCCTC binding factor, has a diverse role in genome regulation including transcriptional regulation, chromosome-boundary insulation, DNA replication, and chromatin packaging. It remains unclear whether a subset of CTCF binding sites plays a functional role in establishing/maintaining chromatin topological domains.
We systematically analysed the genomic, transcriptomic and epigenetic profiles of the CTCF binding sites in 56 human cell lines from ENCODE. We identified ~24,000 CTCF sites (referred to as constitutive sites) that were bound in more than 90% of the cell lines. Our analysis revealed: 1) constitutive CTCF loci were located in constitutive open chromatin and often co-localized with constitutive cohesin loci; 2) most constitutive CTCF loci were distant from transcription start sites and lacked CpG islands but were enriched with the full-spectrum CTCF motifs: a recently reported 33/34-mer and two other potentially novel (22/26-mer); 3) more importantly, most constitutive CTCF loci were present in CTCF-mediated chromatin interactions detected by ChIA-PET and these pair-wise interactions occurred predominantly within, but not between, topological domains identified by Hi-C.
Our results suggest that the constitutive CTCF sites may play a role in organizing/maintaining the recently identified topological domains that are common across most human cells.

Download full-text


Available from: Weichun Huang, Aug 01, 2014
  • Source
    • "B. All 12 proteins comprising the classical multi-subunit NurD chromatin remodeling complex represent protein partners of both NANOG-centered and POU5F1-centered protein-protein interaction networks in hESC (van den Berg et al., 2011; Gagliardi et al., 2013). Genes encoding 10 of 12 (83%) subunits of the classical multisubunit NurD chromatin remodeling complex are located near human-specific NANOG-binding sites in the hESC genome. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Genome-wide proximity placement analysis of 10,598 HSGRL within the context of the principal regulatory structures of the interphase chromatin, namely topologically-associating domains (TADs) and specific sub-TAD structures termed super-enhancer domains (SEDs) revealed that 0.8%-10.3% of TADs contain more than half of HSGRL. Of the 3,127 TADs in the hESC genome, 24 (0.8%); 53 (1.7%); 259 (8.3%); and 322 (10.3%) harbor 1,110 (52.4%); 1,936 (50.9%); 1,151 (59.6%); and 1,601 (58.3%) HSGRL sequences from four distinct families, respectively. TADs that are enriched for HSGRL and termed rapidly-evolving in humans TADs (revTADs) manifest distinct correlation patterns between HSGRL placements and recombination rates. There are significant enrichment within revTAD boundaries of hESC-enhancers, primate-specific CTCF-binding sites, human-specific RNAPII-binding sites, hCONDELs, and H3K4me3 peaks with human-specific enrichment at TSS in prefrontal cortex neurons (p < 0.0001 in all instances). In hESC genome, 331 of 504 (66%) of SE-harboring TADs contain HSGRL and 68% of SEs co-localize with HSGRL, suggesting that HSGRL rewired SE-driven GRNs within revTADs by inserting novel and/or erasing existing regulatory sequences. Consequently, markedly distinct features of chromatin structures evolved in hESC compared to mouse: the SE quantity is 3-fold higher and the median SE size is significantly larger; concomitantly, the TAD number is increased by 42% while the median TAD size is decreased (p=9.11E-37). Present analyses revealed a global role for HSGRL in increasing both quantity and size of SEs and increasing the number and size reduction of TADs, which may facilitate a convergence of TAD and SED architectures of interphase chromatin and define a trend of increasing regulatory complexity during evolution of GRNs.
    • "H2AZ is known to be associated with nucleosome exchange and remodeling [13] [23] [108] [109], it thus likely contributes to the highly dynamic properties of pluripotent chromatin and its refractory character to HP1-associated constitutive heterochromatin extension [23] [27] [99] [101] [110]. This interpretation was further strengthened by the observation that unlike C4, EC4 is enriched in CTCF which besides its insulator properties [102] [103], is also known to mediate long-range intra-and inter-chromosomal interactions [111] [112] [113] [114] [115] [116]. The fact that H2AZ was also found to be broadly distributed in the bivalent state EC2 containing bivalent genes confirmed that the polycomb repressed state C2 resulted from the spreading of H3K27me3 in differentiated cells [23] [27] [99] [101] [110]. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Recent analysis of genome-wide epigenetic modification data, mean replication timing (MRT) profiles and chromosome conformation data in mammals have provided increasing evidence that flexibility in replication origin usage is regulated locally by the epigenetic landscape and over larger genomic distances by the 3D chromatin architecture. Here, we review the recent results establishing some link between replication domains and chromatin structural domains in pluripotent and various differentiated cell types in human. We reconcile the originally proposed dichotomic picture of early and late constant timing regions that replicate by multiple rather synchronous origins in separated nuclear compartments of open and closed chromatins, with the U-shaped MRT domains bordered by "master" replication origins specified by a localized (∼200-300kb) zone of open and transcriptionally active chromatin from which a replication wave likely initiates and propagates toward the domain center via a cascade of origin firing. We discuss the relationships between these MRT domains, topologically associated domains and lamina-associated domains. This review sheds a new light on the epigenetically regulated global chromatin reorganization that underlies the loss of pluripotency and the determination of differentiation properties. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
    FEBS letters 04/2015; 589(20). DOI:10.1016/j.febslet.2015.04.015 · 3.17 Impact Factor
  • Source
    • "This protein contains the N-terminal self-association domain that forms trimers (Cross et al., 2010), and its C-terminal domain is involved in the interaction with LMO2. The multiprotein complexes containing GATA1, TAL1, E2A, LMO2, and LDB1 proteins (named Ldb1 complexes) bind to a conserved paired motif composed of a consensus E-box and a GATA motif (Figure 5A) with restricted orientation and spacing, CANNTG-N8-10-GATA (Cheng et al., 2009; Soler et al., 2010; Li et al., 2013). "
    [Show abstract] [Hide abstract]
    ABSTRACT: Due to advances in genome-wide technologies, consistent distant interactions within chromosomes of higher eukaryotes have been revealed. In particular, it has been shown that enhancers can specifically and directly interact with promoters by looping out intervening sequences, which can be up to several hundred kilobases long. This review is focused on transcription factors that are supposed to be involved in long-range interactions. Available data are in agreement with the model that several known transcription factors and insulator proteins belong to an abundant but poorly studied class of proteins that are responsible for chromosomal architecture.
    Frontiers in Genetics 02/2014; 5:28. DOI:10.3389/fgene.2014.00028
Show more