modMine: flexible access to modENCODE data

Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, UK.
Nucleic Acids Research (Impact Factor: 9.11). 11/2011; 40(Database issue):D1082-8. DOI: 10.1093/nar/gkr921
Source: PubMed

ABSTRACT In an effort to comprehensively characterize the functional elements within the genomes of the important model organisms Drosophila melanogaster and Caenorhabditis elegans, the NHGRI model organism Encyclopaedia of DNA Elements (modENCODE) consortium has generated an enormous library of genomic data along with detailed, structured information on all aspects of the experiments. The modMine database ( described here has been built by the modENCODE Data Coordination Center to allow the broader research community to (i) search for and download data sets of interest among the thousands generated by modENCODE; (ii) access the data in an integrated form together with non-modENCODE data sets; and (iii) facilitate fine-grained analysis of the above data. The sophisticated search features are possible because of the collection of extensive experimental metadata by the consortium. Interfaces are provided to allow both biologists and bioinformaticians to exploit these rich modENCODE data sets now available via modMine.

Download full-text


Available from: Rachel Lyne, Aug 04, 2015
  • Source
    • "However , the loca - tions and extents of UTRs , promoters , and enhancers are more difficult to annotate . For mammalian genes , the TRANSFAC database ( Wingender et al . 1996 ) reports exper - imentally validated transcription factor binding sites , consen - sus binding sequences , etc . The availability of various relevant data from modENCODE ( Contrino et al . 2012 ) should have a positive impact on annotation of regulatory regions in the future . Moreover , new databases relevant to fly transcriptom - ics are already emerging , e . g . , OnTheFly ( Shazman et al . 2013 ) and REDfly ( Gallo et al . 2011 ) . Additionally , although there is a wealth of information about signaling and biochem - ical"
    [Show abstract] [Hide abstract]
    ABSTRACT: Drosophila melanogaster has become a system-of-choice for functional genomic studies. Many resources, including online databases and software tools, are now available to support design or identification of relevant fly stocks and reagents, or analysis and mining of existing functional genomic, transcriptomic, proteomic, etc. datasets. These include large community collections of fly stocks and plasmid clones, 'meta' information sites like FlyBase and FlyMine, and an increasing number of more specialized reagents, databases and online tools. Here, we introduce key resources useful to plan large-scale functional genomics studies in Drosophila and to analyze, integrate and mine the results of those studies in ways that facilitate identification of highest-confidence results and generation of new hypotheses. We also discuss ways in which existing resources can be used and might be improved, and suggest a few areas of future development that would further support large- and small-scale studies in Drosophila and facilitate use of Drosophila information by the research community more generally.
    Genetics 03/2014; 197(1). DOI:10.1534/genetics.113.154344 · 4.87 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: All cells in a multicellular organism contain the same genome, yet different cell types express different sets of genes. Recent advances in high throughput genomic technologies have opened up new opportunities to understand the gene regulatory network in diverse cell types in a genome-wide manner. Here, I discuss recent advances in experimental and computational approaches for the study of gene regulation in embryonic development from a systems perspective. This review is written for computational biologists who have an interest in studying developmental gene regulation through integrative analysis of gene expression, chromatin landscape, and signaling pathways. I highlight the utility of publicly available data and tools, as well as some common analysis approaches.
    Biophysical Reviews 09/2012; 4(3). DOI:10.1007/s12551-012-0092-9
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The regulation of transcription of eukaryotic genes is a very complex process, which involves interactions between transcription factors (TFs) and DNA, as well as other epigenetic factors like histone modifications, DNA methylation, and so on, which nowadays can be studied and characterized with techniques like ChIP-Seq. Cscan is a web resource that includes a large collection of genome-wide ChIP-Seq experiments performed on TFs, histone modifications, RNA polymerases and others. Enriched peak regions from the ChIP-Seq experiments are crossed with the genomic coordinates of a set of input genes, to identify which of the experiments present a statistically significant number of peaks within the input genes' loci. The input can be a cluster of co-expressed genes, or any other set of genes sharing a common regulatory profile. Users can thus single out which TFs are likely to be common regulators of the genes, and their respective correlations. Also, by examining results on promoter activation, transcription, histone modifications, polymerase binding and so on, users can investigate the effect of the TFs (activation or repression of transcription) as well as of the cell or tissue specificity of the genes' regulation and expression. The web interface is free for use, and there is no login requirement. Available at:
    Nucleic Acids Research 06/2012; 40(Web Server issue):W510-5. DOI:10.1093/nar/gks483 · 9.11 Impact Factor
Show more