Correction: a novel bayesian DNA motif comparison method for clustering and retrieval.

PLoS Computational Biology (Impact Factor: 4.83). 05/2011; 7(5). DOI: 10.1371/annotation/d876137b-59c5-48cf-8491-c8cf12f26a9b
Source: PubMed

ABSTRACT [This corrects the article on p. e1000010 in vol. 4.].

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Sequence-specific DNA recognition by gene regulatory proteins is critical for proper cellular functioning. The ability to predict the DNA binding preferences of these regulatory proteins from their amino acid sequence would greatly aid in reconstruction of their regulatory interactions. Structural modeling provides one route to such predictions: by building accurate molecular models of regulatory proteins in complex with candidate binding sites, and estimating their relative binding affinities for these sites using a suitable potential function, it should be possible to construct DNA binding profiles. Here, we present a novel molecular modeling protocol for protein-DNA interfaces that borrows conformational sampling techniques from de novo protein structure prediction to generate a diverse ensemble of structural models from small fragments of related and unrelated protein-DNA complexes. The extensive conformational sampling is coupled with sequence space exploration so that binding preferences for the target protein can be inferred from the resulting optimized DNA sequences. We apply the algorithm to predict binding profiles for a benchmark set of eleven C2H2 zinc finger transcription factors, five of known and six of unknown structure. The predicted profiles are in good agreement with experimental binding data; furthermore, examination of the modeled structures gives insight into observed binding preferences.
    Nucleic Acids Research 02/2011; 39(11):4564-76. · 8.81 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Many important cellular protein interactions are mediated by peptide recognition domains. The ability to predict a domain's binding specificity directly from its primary sequence is essential to understanding the complexity of protein-protein interaction networks. One such recognition domain is the PDZ domain, functioning in scaffold proteins that facilitate formation of signaling networks. Predicting the PDZ domain's binding specificity was a part of the DREAM4 Peptide Recognition Domain challenge, the goal of which was to describe, as position weight matrices, the specificity profiles of five multi-mutant ERBB2IP-1 domains. We developed a method that derives multi-mutant binding preferences by generalizing the effects of single point mutations on the wild type domain's binding specificities. Our approach, trained on publicly available ERBB2IP-1 single-mutant phage display data, combined linear regression-based prediction for ligand positions whose specificity is determined by few PDZ positions, and single-mutant position weight matrix averaging for all other ligand columns. The success of our method as the winning entry of the DREAM4 competition, as well as its superior performance over a general PDZ-ligand binding model, demonstrates the advantages of training a model on a well-selected domain-specific data set.
    PLoS ONE 09/2010; 5(9):e12787. · 3.53 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The Yin Yang 1 (YY1) transcription factor is a master regulator of development, essential for early embryogenesis and adult tissues formation. YY1 is the mammalian orthologue of Pleiohomeotic, one of the transcription factors that binds Polycomb DNA response elements in Drosophila melanogaster and mediates Polycomb group proteins (PcG) recruitment to DNA. Despite several publications pointing at YY1 having a similar role in mammalians, others showed features of YY1 that are not compatible with PcG functions. Here, we show that, in mouse Embryonic Stem (ES) cells, YY1 has genome-wide PcG-independent activities while it is still stably associated with the INO80 chromatin-remodeling complex, as well as with novel RNA helicase activities. YY1 binds chromatin in close proximity of the transcription start site of highly expressed genes. Loss of YY1 functions preferentially led to a down-regulation of target genes expression, as well as to an up-regulation of several small non-coding RNAs, suggesting a role for YY1 in regulating small RNA biogenesis. Finally, we found that YY1 is a novel player of Myc-related transcription factors and that its coordinated binding at promoters potentiates gene expression, proposing YY1 as an active component of the Myc transcription network that links ES to cancer cells.
    Nucleic Acids Research 12/2011; 40(8):3403-18. · 8.81 Impact Factor


1 Download
Available from