Rapid evolution of a recently retroposed transcription factor YY2 in mammalian genomes

Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA.
Genomics (Impact Factor: 2.28). 04/2006; 87(3):348-55. DOI: 10.1016/j.ygeno.2005.11.001
Source: PubMed


YY2 was originally identified due to its unusual similarity to the evolutionarily well-conserved zinc finger gene YY1. In this study, we have determined the evolutionary origin and conservation of YY2 using comparative genomic approaches. Our results indicate that YY2 is a retroposed copy of YY1 that has been inserted into another gene locus named Mbtps2 (membrane-bound transcription factor protease site 2). This retroposition is estimated to have occurred after the divergence of placental mammals from other vertebrates based on the detection of YY2 only in the placental mammals. The N- and C-terminal regions of YY2 have evolved under different selection pressures. The N-terminal region has evolved at a very fast pace with very limited functional constraints, whereas the DNA-binding, C-terminal region still maintains a sequence structure very similar to that of YY1 and is also well conserved among placental mammals. In situ hybridizations using different adult mouse tissues indicate that mouse YY2 is expressed at relatively low levels in Purkinje and granular cells of cerebellum and in neuronal cells of cerebrum, but at very high levels in testis. The expression levels of YY2 are much lower than those of YY1, but the overall spatial expression patterns are similar to those of Mbtps2, suggesting a possible shared transcriptional control between YY2 and Mbtps2. Taken together, the formation and evolution of YY2 represent a very unusual case where a transcription factor was first retroposed into another gene locus encoding a protease and survived with different selection schemes and expression patterns.

Full-text preview

Available from:
  • Source
    • "All three Yy1 family members are present during pre-implantation development [(13,71), R. Pérez and J. Schoorlemmer, unpublished data] and in the germ line (13,72). REX1 shares extensive homology in the DNA-binding zinc fingers with YY1 and YY2, which prompted us to test association of YY1 and YY2 to ERVs bound by REX1 (Figure 4). "
    [Show abstract] [Hide abstract]
    ABSTRACT: Rex1/Zfp42 is a Yy1-related zinc-finger protein whose expression is frequently used to identify pluripotent stem cells. We show that depletion of Rex1 levels notably affected self-renewal of mouse embryonic stem (ES) cells in clonal assays, in the absence of evident differences in expression of marker genes for pluripotency or differentiation. By contrast, marked differences in expression of several endogenous retroviral elements (ERVs) were evident upon Rex1 depletion. We demonstrate association of REX1 to specific elements in chromatin-immunoprecipitation assays, most strongly to muERV-L and to a lower extent to IAP and musD elements. Rex1 regulates muERV-L expression in vivo, as we show altered levels upon transient gain-and-loss of Rex1 function in pre-implantation embryos. We also find REX1 can associate with the lysine-demethylase LSD1/KDM1A, suggesting they act in concert. Similar to REX1 binding to retrotransposable elements (REs) in ES cells, we also detected binding of the REX1 related proteins YY1 and YY2 to REs, although the binding preferences of the two proteins were slightly different. Altogether, we show that Rex1 regulates ERV expression in mouse ES cells and during pre-implantation development and suggest that Rex1 and its relatives have evolved as regulators of endogenous retroviral transcription.
    Nucleic Acids Research 07/2012; 40(18):8993-9007. DOI:10.1093/nar/gks686 · 9.11 Impact Factor
  • Source
    • "The three categories enriched in proteins predicted to be under positive selection are all categories containing transcription factors. Individual transcription factors have been suggested to be under positive selection previously [37-39]. Furthermore, transcription factors have been shown to evolve faster than other protein classes in yeast [40]. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Understanding the adaptive changes that alter the function of proteins during evolution is an important question for biology and medicine. The increasing number of completely sequenced genomes from closely related organisms, as well as individuals within species, facilitates systematic detection of recent selection events by means of comparative genomics. We have used genome-wide strain-specific single nucleotide polymorphism data from 64 strains of budding yeast (Saccharomyces cerevisiae or Saccharomyces paradoxus) to determine whether adaptive positive selection is correlated with protein regions showing propensity for different classes of structure conformation. Data from phylogenetic and population genetic analysis of 3,746 gene alignments consistently shows a significantly higher degree of positive Darwinian selection in intrinsically disordered regions of proteins compared to regions of alpha helix, beta sheet or tertiary structure. Evidence of positive selection is significantly enriched in classes of proteins whose functions and molecular mechanisms can be coupled to adaptive processes and these classes tend to have a higher average content of intrinsically unstructured protein regions. We suggest that intrinsically disordered protein regions may be important for the production and maintenance of genetic variation with adaptive potential and that they may thus be of central significance for the evolvability of the organism or cell in which they occur.
    Genome biology 07/2011; 12(7):R65. DOI:10.1186/gb-2011-12-7-r65 · 10.81 Impact Factor
  • Source
    • "BR serine/threonine kinase 2 (SAD1 ), calcitonin gene-related peptide-receptor component protein (RCP9 ), calcitonin 1 (CALCA), cyclin-dependent kinase 2 (CDK2 ), coronin actin binding protein 1B (CORO1B), interleukin 10 (IL10 ), ninjurin 1 (NINJ1 ), oxidative-stress responsive 1 (OXSR1 ), phospholipase c eta2 (PLCH2 ), spectrin beta non-erythrocytic 1 (SPTBN1 ), plexin B2 (PLXNB2 ), synapsin III (SYN3 ), syntaxin 16 (STX16 ), lim domain and actin binding 1 (LIMA1 ), toll-like receptor 4 (TLR4 ), yy2 transcription factor (YY2 ), microtubule associated serine/threonine kinase 1 (MAST1 ), were some of the AD associated genes in the list of 300 zero TO genes between the ECnet and HIPnet, that were associated with processes such as protein transport, cytoskeletal organisation, neurotransmitter release etc. [33-37]. YY2 is highly similar to the evolutionarily well-conserved zinc finger gene YY1 [38], which activates beta-site amyloid precursor protein-cleaving enzyme 1 (BACE1 ) expression [39]. BACE1, which was present in the list of 271 genes between HIPnet and MTGnet, is necessary for the generation of beta-amyloid peptides, the principal constituents of senile plaques, in AD subjects [40,41]. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Alzheimer's disease (AD) is a progressive neurodegenerative disorder involving variations in the transcriptome of many genes. AD does not affect all brain regions simultaneously. Identifying the differences among the affected regions may shed more light onto the disease progression. We developed a novel method involving the differential topology of gene coexpression networks to understand the association among affected regions and disease severity. We analysed microarray data of four regions--entorhinal cortex (EC), hippocampus (HIP), posterior cingulate cortex (PCC) and middle temporal gyrus (MTG) from AD affected and normal subjects. A coexpression network was built for each region and the topological overlap between them was examined. Genes with zero topological overlap between two region-specific networks were used to characterise the differences between the two regions. Results indicate that MTG shows early AD pathology compared to the other regions. We postulate that if the MTG gets affected later in the disease, post-mortem analyses of individuals with end-stage AD will show signs of early AD in the MTG, while the EC, HIP and PCC will have severe pathology. Such knowledge is useful for data collection in clinical studies where sample selection is a limiting factor as well as highlighting the underlying biology of disease progression.
    BMC Systems Biology 10/2010; 4(1):136. DOI:10.1186/1752-0509-4-136 · 2.44 Impact Factor
Show more