Recommendations for Mass Spectrometry Data Quality Metrics for Open Access Data (Corollary to the Amsterdam Principles)

Johns Hopkins University, Baltimore, Maryland, United States
PROTEOMICS - CLINICAL APPLICATIONS (Impact Factor: 2.96). 12/2011; 5(11-12):580-9. DOI: 10.1002/prca.201100097
Source: PubMed


Policies supporting the rapid and open sharing of proteomic data are being implemented by the leading journals in the field. The proteomics community is taking steps to ensure that data are made publicly accessible and are of high quality, a challenging task that requires the development and deployment of methods for measuring and documenting data quality metrics. On September 18, 2010, the U.S. National Cancer Institute (NCI) convened the "International Workshop on Proteomic Data Quality Metrics" in Sydney, Australia, to identify and address issues facing the development and use of such methods for open access proteomics data. The stakeholders at the workshop enumerated the key principles underlying a framework for data quality assessment in mass spectrometry data that will meet the needs of the research community, journals, funding agencies, and data repositories. Attendees discussed and agreed up on two primary needs for the wide use of quality metrics: (i) an evolving list of comprehensive quality metrics and (ii) standards accompanied by software analytics. Attendees stressed the importance of increased education and training programs to promote reliable protocols in proteomics. This workshop report explores the historic precedents, key discussions, and necessary next steps to enhance the quality of open access data. By agreement, this article is published simultaneously in Proteomics, Proteomics Clinical Applications, Journal of Proteome Research, and Molecular and Cellular Proteomics, as a public service to the research community. The peer review process was a coordinated effort conducted by a panel of referees selected by the journals.

Download full-text


Available from: David L Tabb
  • Source
    • "In 2009, the journal Molecular and Cellular Proteomics mandated public access to raw instrument data as a requirement for publication [20]; this requirement , however, has been held in abeyance due to stability challenges in one of the most widely-used proteomics repositories. A 2010 NCI workshop at the Human Proteomics Organization conference in Sydney began grappling with the challenge of assessing the quality of data sets held in public repositories [21]. If data are stored in proprietary file formats, their public availability may not be sufficient to allow subsequent re-use. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Proteomics has emerged from the labs of technologists to enter widespread application in clinical contexts. This transition, however, has been hindered by overstated early claims of accuracy, concerns about reproducibility, and the challenges of handling batch effects properly. New efforts have produced sets of performance metrics and measurements of variability that establish sound expectations for experiments in clinical proteomics. As researchers begin incorporating these metrics in a quality by design paradigm, the variability of individual steps in experimental pipelines will be reduced, regularizing overall outcomes. This review discusses the evolution of quality assessment in 2D gel electrophoresis, mass spectrometry-based proteomic profiling, tandem mass spectrometry-based protein inventories, and proteomic quantitation. Taken together, the advances in each of these technologies are establishing databases that will be increasingly useful for decision-making in clinical experimentation.
    Full-text · Article · Dec 2012 · Clinical biochemistry
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: We report the release of mzIdentML, an exchange standard for peptide and protein identification data, designed by the Proteomics Standards Initiative. The format was developed by the Proteomics Standards Initiative in collaboration with instrument and software vendors, and the developers of the major open-source projects in proteomics. Software implementations have been developed to enable conversion from most popular proprietary and open-source formats, and mzIdentML will soon be supported by the major public repositories. These developments enable proteomics scientists to start working with the standard for exchanging and publishing data sets in support of publications and they provide a stable platform for bioinformatics groups and commercial software vendors to work with a single file format for identification data.
    Full-text · Article · Feb 2012 · Molecular & Cellular Proteomics
  • [Show abstract] [Hide abstract]
    ABSTRACT: Selected reaction monitoring (SRM) is an accurate quantitative technique, typically used for small-molecule mass spectrometry (MS). SRM has emerged as an important technique for targeted and hypothesis-driven proteomic research, and is becoming the reference method for protein quantification in complex biological samples. SRM offers high selectivity, a lower limit of detection and improved reproducibility, compared to conventional shot-gun-based tandem MS (LC-MS/MS) methods. Unlike LC-MS/MS, which requires computationally intensive informatic postanalysis, SRM requires preacquisition bioinformatic analysis to determine proteotypic peptides and optimal transitions to uniquely identify and to accurately quantitate proteins of interest. Extensive arrays of bioinformatics software tools, both web-based and stand-alone, have been published to assist researchers to determine optimal peptides and transition sets. The transitions are oftentimes selected based on preferred precursor charge state, peptide molecular weight, hydrophobicity, fragmentation pattern at a given collision energy (CE), and instrumentation chosen. Validation of the selected transitions for each peptide is critical since peptide performance varies depending on the mass spectrometer used. In this review, we provide an overview of open source and commercial bioinformatic tools for analyzing LC-MS data acquired by SRM.
    No preview · Article · Apr 2012 · Proteomics
Show more