Improved validation of peptide MS/MS assignments using spectral intensity prediction.
ABSTRACT A major limitation in identifying peptides from complex mixtures by shotgun proteomics is the ability of search programs to accurately assign peptide sequences using mass spectrometric fragmentation spectra (MS/MS spectra). Manual analysis is used to assess borderline identifications; however, it is error-prone and time-consuming, and criteria for acceptance or rejection are not well defined. Here we report a Manual Analysis Emulator (MAE) program that evaluates results from search programs by implementing two commonly used criteria: 1) consistency of fragment ion intensities with predicted gas phase chemistry and 2) whether a high proportion of the ion intensity (proportion of ion current (PIC)) in the MS/MS spectra can be derived from the peptide sequence. To evaluate chemical plausibility, MAE utilizes similarity (Sim) scoring against theoretical spectra simulated by MassAnalyzer software (Zhang, Z. (2004) Prediction of low-energy collision-induced dissociation spectra of peptides. Anal. Chem. 76, 3908-3922) using known gas phase chemical mechanisms. The results show that Sim scores provide significantly greater discrimination between correct and incorrect search results than achieved by Sequest XCorr scoring or Mascot Mowse scoring, allowing reliable automated validation of borderline cases. To evaluate PIC, MAE simplifies the DTA text files summarizing the MS/MS spectra and applies heuristic rules to classify the fragment ions. MAE output also provides data mining functions, which are illustrated by using PIC to identify spectral chimeras, where two or more peptide ions were sequenced together, as well as cases where fragmentation chemistry is not well predicted.
- SourceAvailable from: rockefeller.edu[show abstract] [hide abstract]
ABSTRACT: A system for creating a library of tandem mass spectra annotated with corresponding peptide sequences was described. This system was based on the annotated spectra currently available in the Global Proteome Machine Database (GPMDB). The library spectra were created by averaging together spectra that were annotated with the same peptide sequence, sequence modifications, and parent ion charge. The library was constructed so that experimental peptide tandem mass spectra could be compared with those in the library, resulting in a peptide sequence identification based on scoring the similarity of the experimental spectrum with the contents of the library. A software implementation that performs this type of library search was constructed and successfully used to obtain sequence identifications. The annotated tandem mass spectrum libraries for the Homo sapiens, Mus musculus, and Saccharomyces cerevisiae proteomes and search software were made available for download and use by other groups.Journal of Proteome Research 09/2006; 5(8):1843-9. · 5.06 Impact Factor
- [show abstract] [hide abstract]
ABSTRACT: Protein identification and peptide sequencing by tandem mass spectrometry requires knowledge of how peptides fragment in the gas phase, specifically which bonds are broken and where the charge(s) resides in the products. For many peptides, cleavage at the amide bonds dominate, producing a series of ions that are designated b and y. For other peptides, enhanced cleavage occurs at just one or two amino acid residues. Surface-induced dissociation, along with gas-phase collision-induced dissociation performed under a variety of conditions, has been used to refine the general 'mobile proton' model and to determine how and why enhanced cleavages occur at aspartic acid residues and protonated histidine residues. Enhanced cleavage at acidic residues occurs when the charge is unavailable to the peptide backbone or the acidic side-chain. The acidic H of the side-chain then serves to initiate cleavage at the amide bond immediately C-terminal to Asp (or Glu), producing an anhydride. In contrast, enhanced cleavage occurs at His when the His side-chain is protonated, turning His into a weak acid that can initiate backbone cleavage by transferring a proton to the backbone. This allows the nucleophilic nitrogen of the His side-chain to attack and form a cyclic structure that is different from the 'typical' backbone cleavage structures.Journal of Mass Spectrometry 01/2001; 35(12):1399-406. · 3.21 Impact Factor
- [show abstract] [hide abstract]
ABSTRACT: Serum protein profiling by surface-enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF MS) appears to be an important diagnostic tool for a whole range of diseases. Sensitivities and specificities obtained with this new technology often seem superior to those obtained with current biomarkers. However, reproducibility and standardization are still problematic. The present review gives an overview of the diagnostic value of protein profiles obtained with SELDI in studies on prostate and ovarian cancer. To identify aspects important for protein profiling, we compare and discuss differences in pre- and post-analytical conditions presented in the literature supplemented with some of our own data. Further progress in protein profiling as a diagnostic tool requires a more comprehensive description of technical details in all future studies.Clinical Chemistry and Laboratory Medicine 02/2005; 43(12):1281-90. · 3.01 Impact Factor