Locating mammalian transcription factor binding sites: a survey of computational and experimental techniques

Genomic Functional Analysis Section, National Human Genome Research Institute, National Institutes of Health, Rockville, Maryland 20878, USA.
Genome Research (Impact Factor: 13.85). 01/2007; 16(12):1455-64. DOI: 10.1101/gr.4140006
Source: PubMed

ABSTRACT Fields such as genomics and systems biology are built on the synergism between computational and experimental techniques. This type of synergism is especially important in accomplishing goals like identifying all functional transcription factor binding sites in vertebrate genomes. Precise detection of these elements is a prerequisite to deciphering the complex regulatory networks that direct tissue specific and lineage specific patterns of gene expression. This review summarizes approaches for in silico, in vitro, and in vivo identification of transcription factor binding sites. A variety of techniques useful for localized- and high-throughput analyses are discussed here, with emphasis on aspects of data generation and verification.

Download full-text


Available from: Peggy J Farnham, Dec 18, 2013
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Functional annotation of the genome is important to understand the phenotypic complexity of various species. The road toward functional annotation involves several challenges ranging from experiments on individual molecules to large-scale analysis of high-throughput sequencing (HTS) data. HTS data is typically a result of the protocol designed to address specific research questions. The sequencing results in reads, which when mapped to a reference genome often leads to the formation of distinct patterns (read profiles). Interpretation of these read profiles is essential for their analysis in relation to the research question addressed. Several strategies have been employed at varying levels of abstraction ranging from a somewhat ad hoc to a more systematic analysis of read profiles. These include methods which can compare read profiles, e.g., from direct (non-sequence based) alignments to classification of patterns into functional groups. In this review, we highlight the emerging applications of read profiles for the annotation of non-coding RNA and cis-regulatory elements (CREs) such as enhancers and promoters. We also discuss the biological rationale behind their formation.
    Frontiers in Genetics 05/2015; 6:188. DOI:10.3389/fgene.2015.00188
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The investigation of metabolic regulation at the transcriptional level presents different challenges than those encountered in the study of other important problems like development or cancer. Levels of key components like glucose, insulin, and lipids can be modulated but rarely change in an all-or-none fashion, necessitating quantitative techniques that can be applied to multiple tissues and systems. This review examines recent advances in methods for studying transcriptional regulation, with special emphasis on metabolic science. We compare these methods for investigators trying to decide on the best approach for their particular physiological paradigm or model system.
    Cell metabolism 12/2011; 14(6):739-45. DOI:10.1016/j.cmet.2011.11.007 · 16.75 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: A major prerequisite for the investigation of tissue-specific processes is the identification of cis-regulatory elements. No generally applicable technique is available to distinguish them from any other type of genomic non-coding sequence. Therefore, researchers often have to identify these elements by elaborate in vivo screens, testing individual regions until the right one is found. Here, based on many examples from the literature, we summarize how functional enhancers have been isolated from other elements in the genome and how they have been characterized in transgenic animals. Covering computational and experimental studies, we provide an overview of the global properties of cis-regulatory elements, like their specific interactions with promoters and target gene distances. We describe conserved non-coding elements (CNEs) and their internal structure, nucleotide composition, binding site clustering and overlap, with a special focus on developmental enhancers. Conflicting data and unresolved questions on the nature of these elements are highlighted. Our comprehensive overview of the experimental shortcuts that have been found in the different model organism communities and the new field of high-throughput assays should help during the preparation phase of a screen for enhancers. The review is accompanied by a list of general guidelines for such a project.
    Developmental Biology 02/2011; 350(2):239-54. DOI:10.1016/j.ydbio.2010.11.026 · 3.64 Impact Factor