Prediction of high-responding peptides for targeted protein assays by mass spectrometry.

Broad Institute of Massachusetts Institute of Technology and Harvard, 7 Cambridge Center, Cambridge, Massachusetts 02142, USA.
Nature Biotechnology (Impact Factor: 39.08). 02/2009; 27(2):190-8. DOI: 10.1038/nbt.1524
Source: PubMed

ABSTRACT Protein biomarker discovery produces lengthy lists of candidates that must subsequently be verified in blood or other accessible biofluids. Use of targeted mass spectrometry (MS) to verify disease- or therapy-related changes in protein levels requires the selection of peptides that are quantifiable surrogates for proteins of interest. Peptides that produce the highest ion-current response (high-responding peptides) are likely to provide the best detection sensitivity. Identification of the most effective signature peptides, particularly in the absence of experimental data, remains a major resource constraint in developing targeted MS-based assays. Here we describe a computational method that uses protein physicochemical properties to select high-responding peptides and demonstrate its utility in identifying signature peptides in plasma, a complex proteome with a wide range of protein concentrations. Our method, which employs a Random Forest classifier, facilitates the development of targeted MS-based assays for biomarker verification or any application where protein levels need to be measured.

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The search for clinically useful protein biomarkers using advanced mass spectrometry approaches represents a major focus in cancer research. However, the direct analysis of human samples may be challenging due to limited availability, the absence of appropriate control samples, or the large background variability observed in patient material. As an alternative approach, human tumors orthotopically implanted into a different species (xenografts) are clinically relevant models that have proven their utility in pre-clinical research. Patient derived xenografts for glioblastoma have been extensively characterized in our laboratory and have been shown to retain the characteristics of the parental tumor at the phenotypic and genetic level. Such models were also found to adequately mimic the behavior and treatment response of human tumors. The reproducibility of such xenograft models, the possibility to identify their host background and perform tumor-host interaction studies, are major advantages over the direct analysis of human samples. At the proteome level, the analysis of xenograft samples is challenged by the presence of proteins from two different species which, depending on tumor size, type or location, often appear at variable ratios. Any proteomics approach aimed at quantifying proteins within such samples must consider the identification of species specific peptides in order to avoid biases introduced by the host proteome. Here, we present an in-house methodology and tool developed to select peptides used as surrogates for protein candidates from a defined proteome (e.g., human) in a host proteome background (e.g., mouse, rat) suited for a mass spectrometry analysis. The tools presented here are applicable to any species specific proteome, provided a protein database is available. By linking the information from both proteomes, PeptideManager significantly facilitates and expedites the selection of peptides used as surrogates to analyze proteins of interest.
    Frontiers in Genetics 09/2014; 5. DOI:10.3389/fgene.2014.00305
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Peptide quantification using MS often relies on the comparison of peptide signal intensities between different samples, which is based on the assumption that observed signal intensity has a linear relationship to peptide abundance. A typical proteomics experiment is subject to multiple sources of variance, so we focussed here on properties affecting peptide linearity under simple, well-defined conditions. Peptides from a standard protein digest were analysed by multiple reaction monitoring (MRM) MS to determine peptide linearity over a range of concentrations. We show that many peptides do not display a linear relationship between signal intensity and amount under standard conditions. Increasing the organic content of the sample solvent increased peptide linearity by increasing the accuracy and precision of quantification, which suggests that peptide non-linearity is due to concentration-dependent surface adsorption. Using multiple peptides at various dilutions, we show that peptide non-linearity is related to observed retention time and predicted hydrophobicity. Whereas the effect of adsorption on peptide storage has been investigated previously, here we demonstrate the deleterious effect of peptide adsorption on the quantification of fresh samples, highlight aspects of sample preparation that can minimise the effect, and suggest bioinformatic approaches to enhance the selection of peptides for quantification.
    Journal of proteomics 05/2013; 85(100). DOI:10.1016/j.jprot.2013.04.034 · 3.93 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Assembling peptides identified from tandem mass spectra into a list of proteins, referred to as protein inference, is a critical step in proteomics research. Due to the existence of degenerate peptides and 'one-hit wonders', it is very difficult to determine which proteins are present in the sample. In this paper, we review existing protein inference methods and classify them according to the source of peptide identifications and the principle of algorithms. It is hoped that the readers will gain a good understanding of the current development in this field after reading this review and come up with new protein inference algorithms.
    Briefings in Bioinformatics 02/2012; 13(5):586-614. DOI:10.1093/bib/bbs004 · 5.92 Impact Factor


Available from