Word-based characterization of promoters involved in human DNA repair pathways

Bioinformatics Laboratory, School of Electrical Engineering and Computer Science, Ohio University, Athens, Ohio, USA.
BMC Genomics (Impact Factor: 3.99). 02/2009; 10 Suppl 1(Suppl 1):S18. DOI: 10.1186/1471-2164-10-S1-S18
Source: PubMed


DNA repair genes provide an important contribution towards the surveillance and repair of DNA damage. These genes produce a large network of interacting proteins whose mRNA expression is likely to be regulated by similar regulatory factors. Full characterization of promoters of DNA repair genes and the similarities among them will more fully elucidate the regulatory networks that activate or inhibit their expression. To address this goal, the authors introduce a technique to find regulatory genomic signatures, which represents a specific application of the genomic signature methodology to classify DNA sequences as putative functional elements within a single organism.
The effectiveness of the regulatory genomic signatures is demonstrated via analysis of promoter sequences for genes in DNA repair pathways of humans. The promoters are divided into two classes, the bidirectional promoters and the unidirectional promoters, and distinct genomic signatures are calculated for each class. The genomic signatures include statistically overrepresented words, word clusters, and co-occurring words. The robustness of this method is confirmed by the ability to identify sequences that exist as motifs in TRANSFAC and JASPAR databases, and in overlap with verified binding sites in this set of promoter regions.
The word-based signatures are shown to be effective by finding occurrences of known regulatory sites. Moreover, the signatures of the bidirectional and unidirectional promoters of human DNA repair pathways are clearly distinct, exhibiting virtually no overlap. In addition to providing an effective characterization method for related DNA sequences, the signatures elucidate putative regulatory aspects of DNA repair pathways, which are notably under-characterized.

Download full-text


Available from: Jens Lichtenberg, Sep 02, 2014
  • Source
    • "A strong relation to stress responses was strongly suggested based on the GO clusters generated for the complete gene list (Table 1(j)). A detailed analysis of the promoter regions of the clustered genes identified regulatory genomic signatures [14] [15], i.e., putative cis -regulatory elements and modules associated with gravitropic control of transcription. Genes which have similar expression patterns typically share the same regulatory element (word) in their promoter regions. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Gravity is a common stimulus affecting plant growth and develop-ment, from seed germination to positioning of flowers for pollina-tion and seeds for dispersal. Classic models of plant gravitropism have revolved around biophysical perception of the gravity stimulus and the effects of plant growth regulators on the growth response. Transcriptional regulation of the gravitropic mechanism has been largely ignored. The aim of this experiment is to identify putative regulatory functional elements, including transcription factor bind-ing sites and cis -regulatory modules involved in gravitropic signal transduction. In this article, we detailed a strategy to identify putative cis -regulatory elements by analyzing gene expression data from mi-croarray experiments. Genes involved in the gravitropic perception– response pathway were identified based on their changes in ex-pression level after gravity stimulation. Genes were clustered ac-cording to their expression patterns (transcriptional regulation pro-files), and gene promoter were analyzed using genomics regulatory analysis software to identify candidate cis -regulatory elements and cis -regulatory modules. Analysis of the microarray data indicated that 154 genes were involved in the gravitropic response. The genes were grouped into 9 clusters based on expression profile similarities. An analysis of the promoters of the 154 genes resulted in the identification of 32 putative regulatory elements and 55 putative regulatory modules. Some of the elements are associated with individual clusters and other elements are associated with multiple clusters, potentially indicating elements involved in specific and in general gravitropic response processes, respectively.
    Full-text · Article · Aug 2010
  • Source

    Preview · Article · Jan 2010
  • [Show abstract] [Hide abstract]
    ABSTRACT: Encyclopedias of regulatory genomic elements provide a foundation for research in areas such as disease diagnosis, disease treatment, and crop enhancement. The construction of complete encyclopedias of organism-specific genomic elements involved in gene regulation remains a significant challenge. To address this problem, the authors present novel bioinformatics strategies for exploring the word landscapes of putative regulatory regions of genomes. The methods are incorporated into the WordSeeker software tool, which is available at The effectiveness of these strategies is demonstrated through several case studies.
    No preview · Article · Jun 2009
Show more