Publications (95) View all
-
Article: Evidence for a persistent microbial seed bank throughout the global ocean.
[show abstract] [hide abstract]
ABSTRACT: Do bacterial taxa demonstrate clear endemism, like macroorganisms, or can one site's bacterial community recapture the total phylogenetic diversity of the world's oceans? Here we compare a deep bacterial community characterization from one site in the English Channel (L4-DeepSeq) with 356 datasets from the International Census of Marine Microbes (ICoMM) taken from around the globe (ranging from marine pelagic and sediment samples to sponge-associated environments). At the L4-DeepSeq site, increasing sequencing depth uncovers greater phylogenetic overlap with the global ICoMM data. This site contained 31.7-66.2% of operational taxonomic units identified in a given ICoMM biome. Extrapolation of this overlap suggests that 1.93 × 10(11) sequences from the L4 site would capture all ICoMM bacterial phylogenetic diversity. Current technology trends suggest this limit may be attainable within 3 y. These results strongly suggest the marine biosphere maintains a previously undetected, persistent microbial seed bank.Proceedings of the National Academy of Sciences 03/2013; · 9.68 Impact Factor -
Article: Meeting Report: Hackathon-Workshop on Darwin Core and MIxS Standards Alignment (February 2012).
Eamonn Ó Tuama, John Deck, Gabriel Dröge, Markus Döring, Dawn Field, Renzo Kottmann, Juncai Ma, Hiroshi Mori, Norman Morrison, Peter Sterk, Hideaki Sugawara, John Wieczorek, Linhuan Wu, Pelin Yilmaz[show abstract] [hide abstract]
ABSTRACT: The Global Biodiversity Information Facility and the Genomic Standards Consortium convened a joint workshop at the University of Oxford, 27-29 February 2012, with a small group of experts from Europe, USA, China and Japan, to continue the alignment of the Darwin Core with the MIxS and related genomics standards. Several reference mappings were produced as well as test expressions of MIxS in RDF. The use and management of controlled vocabulary terms was considered in relation to both GBIF and the GSC, and tools for working with terms were reviewed. Extensions for publishing genomic biodiversity data to the GBIF network via a Darwin Core Archive were prototyped and work begun on preparing translations of the Darwin Core to Japanese and Chinese. Five genomic repositories were identified for engagement to begin the process of testing the publishing of genomic data to the GBIF network commencing with the SILVA rRNA database.Standards in Genomic Sciences 10/2012; 7(1):166-70. · 1.62 Impact Factor -
SourceAvailable from: Robert John Robbins
Article: RCN4GSC Workshop Report: Managing Data at the Interface of Biodiversity and (Meta)Genomics, March 2011.
Robert J Robbins, Linda Amaral-Zettler, Holly Bik, Stan Blum, James Edwards, Dawn Field, George Garrity, Jack A Gilbert, Renzo Kottmann, Leonard Krishtalka, Hilmar Lapp, Carolyn Lawrence, Norman Morrison, Eamonn Ó Tuama, Cynthia Parr, Inigo San Gil, David Schindel, Lynn Schriml, David Vieglas, John Wooley[show abstract] [hide abstract]
ABSTRACT: Building on the planning efforts of the RCN4GSC project, a workshop was convened in San Diego to bring together experts from genomics and metagenomics, biodiversity, ecology, and bioinformatics with the charge to identify potential for positive interactions and progress, especially building on successes at establishing data standards by the GSC and by the biodiversity and ecological communities. Until recently, the contribution of microbial life to the biomass and biodiversity of the biosphere was largely overlooked (because it was resistant to systematic study). Now, emerging genomic and metagenomic tools are making investigation possible. Initial research findings suggest that major advances are in the offing. Although different research communities share some overlapping concepts and traditions, they differ significantly in sampling approaches, vocabularies and workflows. Likewise, their definitions of 'fitness for use' for data differ significantly, as this concept stems from the specific research questions of most importance in the different fields. Nevertheless, there is little doubt that there is much to be gained from greater coordination and integration. As a first step toward interoperability of the information systems used by the different communities, participants agreed to conduct a case study on two of the leading data standards from the two formerly disparate fields: (a) GSC's standard checklists for genomics and metagenomics and (b) TDWG's Darwin Core standard, used primarily in taxonomy and systematic biology.Standards in Genomic Sciences 10/2012; 7(1):159-65. · 1.62 Impact Factor -
Article: The Metadata Coverage Index (MCI): A standardized metric for quantifying database metadata richness.
Konstantinos Liolios, Lynn Schriml, Lynette Hirschman, Ioanna Pagani, Bahador Nosrat, Peter Sterk, Owen White, Philippe Rocca-Serra, Susanna-Assunta Sansone, Chris Taylor, Nikos C Kyrpides, Dawn Field[show abstract] [hide abstract]
ABSTRACT: Variability in the extent of the descriptions of data ('metadata') held in public repositories forces users to assess the quality of records individually, which rapidly becomes impractical. The scoring of records on the richness of their description provides a simple, objective proxy measure for quality that enables filtering that supports downstream analysis. Pivotally, such descriptions should spur on improvements. Here, we introduce such a measure - the 'Metadata Coverage Index' (MCI): the percentage of available fields actually filled in a record or description. MCI scores can be calculated across a database, for individual records or for their component parts (e.g., fields of interest). There are many potential uses for this simple metric: for example; to filter, rank or search for records; to assess the metadata availability of an ad hoc collection; to determine the frequency with which fields in a particular record type are filled, especially with respect to standards compliance; to assess the utility of specific tools and resources, and of data capture practice more generally; to prioritize records for further curation; to serve as performance metrics of funded projects; or to quantify the value added by curation. Here we demonstrate the utility of MCI scores using metadata from the Genomes Online Database (GOLD), including records compliant with the 'Minimum Information about a Genome Sequence' (MIGS) standard developed by the Genomic Standards Consortium. We discuss challenges and address the further application of MCI scores; to show improvements in annotation quality over time, to inform the work of standards bodies and repository providers on the usability and popularity of their products, and to assess and credit the work of curators. Such an index provides a step towards putting metadata capture practices and in the future, standards compliance, into a quantitative and objective framework.Standards in Genomic Sciences 07/2012; 6(3):438-47. · 1.62 Impact Factor -
SourceAvailable from: Folker Meyer
Article: Report of the 13(th) Genomic Standards Consortium Meeting, Shenzhen, China, March 4-7, 2012.
Jack A Gilbert, Yiming Bao, Hui Wang, Susanna-Assunta Sansone, Scott C Edmunds, Norman Morrison, Folker Meyer, Lynn M Schriml, Neil Davies, Peter Sterk, Jared Wilkening, George M Garrity, Dawn Field, Robert Robbins, Daniel P Smith, Ilene Mizrachi, Corrie Moreau[show abstract] [hide abstract]
ABSTRACT: This report details the outcome of the 13(th) Meeting of the Genomic Standards Consortium. The three-day conference was held at the Kingkey Palace Hotel, Shenzhen, China, on March 5-7, 2012, and was hosted by the Beijing Genomics Institute. The meeting, titled From Genomes to Interactions to Communities to Models, highlighted the role of data standards associated with genomic, metagenomic, and amplicon sequence data and the contextual information associated with the sample. To this end the meeting focused on genomic projects for animals, plants, fungi, and viruses; metagenomic studies in host-microbe interactions; and the dynamics of microbial communities. In addition, the meeting hosted a Genomic Observatories Network session, a Genomic Standards Consortium biodiversity working group session, and a Microbiology of the Built Environment session sponsored by the Alfred P. Sloan Foundation.Standards in Genomic Sciences 05/2012; 6(2):276-86. · 1.62 Impact Factor