Variation in Homeodomain DNA Binding Revealed by High-Resolution Analysis of Sequence Preferences

Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA.
Cell (Impact Factor: 32.24). 07/2008; 133(7):1266-76. DOI: 10.1016/j.cell.2008.05.024
Source: PubMed


Most homeodomains are unique within a genome, yet many are highly conserved across vast evolutionary distances, implying strong selection on their precise DNA-binding specificities. We determined the binding preferences of the majority (168) of mouse homeodomains to all possible 8-base sequences, revealing rich and complex patterns of sequence specificity and showing that there are at least 65 distinct homeodomain DNA-binding activities. We developed a computational system that successfully predicts binding sites for homeodomain proteins as distant from mouse as Drosophila and C. elegans, and we infer full 8-mer binding profiles for the majority of known animal homeodomains. Our results provide an unprecedented level of resolution in the analysis of this simple domain structure and suggest that variation in sequence recognition may be a factor in its functional diversity and evolutionary success.

Download full-text


Available from: Olga Borisovna Botvinnik
  • Source
    • "Even though NKX2-5Y191C cannot bind DNA, the most overrepresented motifs were 5′AAGTGT3′ (NKE), 5′GATAA3′ (GATA), and 5′TGCCAA3′ (NF1-like), exactly as for NKX2-5 WT fusion (Supplementary file 1). NKX2-5Y191C peaks also showed overrepresentation of the motif 5′TAATC3′, which is similar to the binding sites of many non-NK-2 class HD proteins, including HOX proteins, as well as that of NK-2 proteins that lack HD tyrosine 54, such as NKX1-2 (Berger et al., 2008). NK-2 class proteins which do carry Y54 in their HDs, including NKX2-5, also bind to this HOX-like site, albeit with a 10-fold-reduced affinity compared to that of the NKE (Chen and Schwartz, 1995). "
    [Show abstract] [Hide abstract]
    ABSTRACT: To model cardiac gene regulatory networks in health and disease we used DamID to establish robust target gene sets for the cardiac homeodomain factor NKX2-5 and two congenital heart disease-associated mutants carrying a crippled homeodomain, which normally functions as DNA- and protein-binding interface. Despite compromised direct DNA-binding, NKX2-5 mutants retained partial functionality and bound hundreds of targets, including NKX2-5 wild type targets and unique sets of 'off-targets'. NKX2-5∆HD, which lacks the entire homeodomain, could still dimerise with wild type NKX2-5 and its cofactors, including newly-discovered cofactors of the ETS family, through the conserved tyrosine-rich domain (YRD). NKX2-5∆HD off-targets showed overrepresentation of many binding motifs, including ETS motifs, the majority co-occupied by ETS proteins as determined by DamID. Off-targets of an NKX2-5 YRD mutant were not enriched in ETS targets. Our study reveals off-target binding and transcriptional activity for NKX2-5 mutations driven in part by cofactor interactions, suggesting a novel type of gain-of-function in congenital heart disease.
    Full-text · Article · Jul 2015 · eLife Sciences
  • Source
    • "Trawler integrated the sequence data to identify overrepresented DNA sequence motifs (from 6–20 nucleotides) among the cardiac-expressed gene promoter regions, in comparison to the intergenic sequence of noncardiac " background " receptors. These motifs were then compared with known transcription factor binding sites (TFBS) from a collection of databases, including Jaspar (Bryne et al. 2008), TRANSFAC (Knuppel et al. 1994), as well as from a large set of homeodomain protein recognition sites (Berger et al. 2008). An up-to-date, curated collection of these databases is available and searchable using STAMP (Mahony and Benos 2007). "
    [Show abstract] [Hide abstract]
    ABSTRACT: G protein-coupled receptors are the principal mediators of the sweet, umami, bitter, and fat taste qualities in mammals. Intriguingly, the taste receptors are also expressed outside of the oral cavity, including in the gut, airways, brain, and heart, where they have additional functions and contribute to disease. However, there is little known about the mechanisms governing the transcriptional regulation of taste receptor genes. Following our recent delineation of taste receptors in the heart, we investigated the genomic loci encoding for taste receptors to gain insight into the regulatory mechanisms that drive their expression in the heart. Gene expression analyses of healthy and diseased human and mouse hearts showed coordinated expression for a subset of chromosomally clustered taste receptors. This chromosomal clustering mirrored the cardiac expression profile, suggesting that a common gene regulatory block may control the taste receptor locus. We identified unique domains with strong regulatory potential in the vicinity of taste receptor genes. We also performed de novo motif enrichment in the proximal promoter regions and found several overrepresented DNA motifs in cardiac taste receptor gene promoters corresponding to ubiquitous and cardiac-specific transcription factor binding sites. Thus, combining cardiac gene expression data with bioinformatic analyses, this study has provided insights into the noncoding regulatory landscape for taste GPCRs. These findings also have broader relevance for the study of taste GPCRs outside of the classical gustatory system, where understanding the mechanisms controlling the expression of these receptors may have implications for future therapeutic development.
    Full-text · Article · May 2015 · Archiv für Experimentelle Pathologie und Pharmakologie
  • Source
    • "These differentially expressed genes represent both direct and indirect targets of Rx3. The presence of RAX-binding motifs as defined by a position weight matrix in the de-regulated genes was shown to further support the contention that some of the genetic network is directly regulated by Rx3 [8, 56]. The core of the RAX binding motif is a short sequence “TAATTA”, thus it is relatively common to find this sequence in the zebrafish genome (about 1 in every 4 kb). "
    [Show abstract] [Hide abstract]
    ABSTRACT: Background The genetic cascades underpinning vertebrate early eye morphogenesis are poorly understood. One gene family essential for eye morphogenesis encodes the retinal homeobox (Rx) transcription factors. Mutations in the human retinal homeobox gene (RAX) can lead to gross morphological phenotypes ranging from microphthalmia to anophthalmia. Zebrafish rx3 null mutants produce a similar striking eyeless phenotype with an associated expanded forebrain. Thus, we used zebrafish rx3-/- mutants as a model to uncover an Rx3-regulated gene network during early eye morphogenesis. Results Rx3-regulated genes were identified using whole transcriptomic sequencing (RNA-seq) of rx3-/- mutants and morphologically wild-type siblings during optic vesicle morphogenesis. A gene co-expression network was then constructed for the Rx3-regulated genes, identifying gene cross-talk during early eye development. Genes highly connected in the network are hub genes, which tend to exhibit higher expression changes between rx3-/- mutants and normal phenotype siblings. Hub genes down-regulated in rx3-/- mutants encompass homeodomain transcription factors and mediators of retinoid-signaling, both associated with eye development and known human eye disorders. In contrast, genes up-regulated in rx3-/- mutants are centered on Wnt signaling pathways, associated with brain development and disorders. The temporal expression pattern of Rx3-regulated genes was further profiled during early development from maternal stage until visual function is fully mature. Rx3-regulated genes exhibited synchronized expression patterns, and a transition of gene expression during the early segmentation stage when Rx3 was highly expressed. Furthermore, most of these deregulated genes are enriched with multiple RAX-binding motif sequences on the gene promoter. Conclusions Here, we assembled a comprehensive model of Rx3-regulated genes during early eye morphogenesis. Rx3 promotes optic vesicle morphogenesis and represses brain development through a highly correlated and modulated network, exhibiting repression of genes mediating Wnt signaling and concomitant enhanced expression of homeodomain transcription factors and retinoid-signaling genes. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-825) contains supplementary material, which is available to authorized users.
    Full-text · Article · Sep 2014 · BMC Genomics
Show more