Article

Identification of human transcription factor binding sites by comparative genomics. M. Caselle 1 , D. Cor 1 , C. Herrmann 1 , C. Dieterich 2 , F. Di Cunto 3 and P. Provero 4

07/2003;
Source: CiteSeer

ABSTRACT Introduction Understanding transcriptional regulation of gene expression is one of the greatest challenges of modern molecular biology. A central role in this mechanism is played by transcription factors (TF) which typically bind to specific, short DNA sequence motifs which are usually located in the upstream region of the regulated genes. We discuss here a simple and powerful approach for the identification of these cis-regulatory motifs based on human-mouse genomic comparison. By using the catalogue of conserved upstream sequences collected in the CORG database [1] we construct sets of genes sharing the same overrepresented motif in their upstream regions both in human and in mouse. We perform this construction for all possible words from 5 to 8 nucleotides in length and then filter the resulting sets looking for two types of evidence for coregulation: first, we analyse the Gene Ontology annotation of the genes in the set looking for statistically significant common annotation; seco

0 0
 · 
0 Bookmarks
 · 
41 Views

Keywords

8 nucleotides
 
central role
 
cis-regulatory motifs
 
conserved upstream sequences
 
coregulation
 
gene expression
 
Gene Ontology annotation
 
genes sharing
 
greatest challenges
 
human-mouse genomic comparison
 
Introduction Understanding transcriptional regulation
 
modern molecular biology
 
overrepresented motif
 
powerful approach
 
resulting sets
 
short DNA sequence motifs
 
statistically significant common annotation
 
TF
 
upstream region
 
upstream regions