Exploring proteomes and analyzing protein processing by mass spectrometric identification of sorted N-terminal peptides.

Department of Medical Protein Research, Flanders Interuniversity Institute for Biotechnology, Ghent University, A. Baertsoenkaai 3, B-9000 Ghent, Belgium.
Nature Biotechnology (Impact Factor: 39.08). 06/2003; 21(5):566-9. DOI: 10.1038/nbt810
Source: PubMed

ABSTRACT Current non-gel techniques for analyzing proteomes rely heavily on mass spectrometric analysis of enzymatically digested protein mixtures. Prior to analysis, a highly complex peptide mixture is either separated on a multidimensional chromatographic system or it is first reduced in complexity by isolating sets of representative peptides. Recently, we developed a peptide isolation procedure based on diagonal electrophoresis and diagonal chromatography. We call it combined fractional diagonal chromatography (COFRADIC). In previous experiments, we used COFRADIC to identify more than 800 Escherichia coli proteins by tandem mass spectrometric (MS/MS) analysis of isolated methionine-containing peptides. Here, we describe a diagonal method to isolate N-terminal peptides. This reduces the complexity of the peptide sample, because each protein has one N terminus and is thus represented by only one peptide. In this new procedure, free amino groups in proteins are first blocked by acetylation and then digested with trypsin. After reverse-phase (RP) chromatographic fractionation of the generated peptide mixture, internal peptides are blocked using 2,4,6-trinitrobenzenesulfonic acid (TNBS); they display a strong hydrophobic shift and therefore segregate from the unaltered N-terminal peptides during a second identical separation step. N-terminal peptides can thereby be specifically collected for further liquid chromatography (LC)-MS/MS analysis. Omitting the acetylation step results in the isolation of non-lysine-containing N-terminal peptides from in vivo blocked proteins.

Download full-text


Available from: An Staes, Feb 19, 2015
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Abstract Among the neglected tropical diseases, leishmaniasis is one of the most devastating, resulting in significant mortality and contributing to nearly 2 million disability-adjusted life years. Cutaneous leishmaniasis is a debilitating disorder caused by the kinetoplastid protozoan parasite Leishmania major, which results in disfiguration and scars. L. major genome was the first to be sequenced within the genus Leishmania. Use of proteomic data for annotating genomes is a complementary approach to conventional genome annotation approaches and is referred to as proteogenomics. We have used a proteogenomics-based approach to map the proteome of L. major and also annotate its genome. In this study, we searched L. major promastigote proteomic data against the annotated L. major protein database. Additionally, we searched the proteomic data against six-frame translated L. major genome. In all, we identified 3613 proteins in L. major promastigotes, which covered 43% of its proteome. We also identified 26 genome search-specific peptides, which led to the identification of three novel genes previously not identified in L. major. We also corrected the annotation of N-termini of 15 genes, which resulted in extension of their protein products. We have validated our proteogenomics findings by RT-PCR and sequencing. In addition, our study resulted in identification of 266 N-terminally acetylated peptides in L. major, one of the largest acetylated peptide datasets thus far in Leishmania. This dataset should be a valuable resource to researchers focusing on neglected tropical diseases.
    Omics A Journal of Integrative Biology 06/2014; DOI:10.1089/omi.2013.0159 · 2.73 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Proteolytic cleavages generate active precursor proteins by creating new N-termini in the proteins. A number of strategies have recently been published regarding the enrichment of original or newly formed N-terminal peptides using guanidination of lysine residues and amine-reactive reagents. For effective enrichment of N-terminal peptides, the efficiency of trypsin proteolysis on homoarginine (guanidinated) modified proteins must be understood and simple and versatile solid-phase N-terminal capture strategies should be developed. We present here a mass spectrometry (MS)-based study to evaluate and optimize the trypsin proteolysis on a guanidinated-modified protein. Trypsin proteolysis was studied using different amounts of trypsin to modified protein ratios. To capture the original N-termini, after guanidination of proteins, original N-termini were acetylated and the proteins were digested with trypsin. The newly formed N-terminal tryptic peptides were captured with a new amine reactive acid-cleavable solid-phase reagent. The original N-terminal peptides were then collected from the supernatant of the solution. We demonstrated a detailed study of the efficiency of enzyme trypsin on homoarginine-modified proteins. We observed that the rate of hydrolysis of homoarginine residues compared to their lysine/arginine counterparts were slower but generally cleaved after an overnight digestion period depending on the protein to protease concentration ratios. Selectivity of the solid-phase N-terminal reagent was studied by enrichment of original N-terminal peptides from two standard proteins, ubiquitin and RNaseS. We found enzyme trypsin is active in the guanidinated form of the protein depending on the enzyme to protein concentrations, time and the proximity of arginine residues in the sequence. The novel solid-phase capture reagent also successfully enriched N-terminal peptides from the standard protein mixtures. We believe this trypsin proteolysis study on homoarginine-modified proteins and our simple and versatile solid-phase capture strategy could be very useful for enrichment and sequence determination of proteins N-termini by MS. Copyright © 2014 John Wiley & Sons, Ltd.
    Rapid Communications in Mass Spectrometry 03/2014; 28(6):635-44. DOI:10.1002/rcm.6820 · 2.64 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The secretopeptidome comprises endogenous peptides derived from proteins secreted into the tumour microenvironment through classical and non-classical secretion. This study characterised the low-Mr (<3kDa) component of the human colon tumour (LIM1215, LIM1863) secretopeptidome, as a first step towards gaining insights into extracellular proteolytic cleavage events in the tumour microenvironment. Based on two biological replicates, this secretopeptidome isolation strategy utilised differential centrifugal ultrafiltration in combination with analytical RP-HPLC and nanoLC-MS/MS. Secreted peptides were identified using a combination of Mascot and post-processing analyses including MSPro re-scoring, extended feature sets and Percolator, resulting in 474 protein identifications from 1228 peptides (≤1% q-value, ≤5% PEP) - a 36% increase in peptide identifications when compared with conventional Mascot (homology ionscore thresholding). In both colon tumour models, 122 identified peptides were derived from 41 cell surface protein ectodomains, 23 peptides (12 proteins) from regulated intramembrane proteolysis (RIP), and 12 peptides (9 proteins) generated from intracellular domain proteolysis. Further analyses using the protease/substrate database MEROPS (, revealed 335 (71%) proteins classified as originating from classical/non-classical secretion, or the cell membrane. Of these, peptides were identified from 42 substrates in MEROPS with defined protease cleavage sites, while peptides generated from a further 205 substrates were fragmented by hitherto unknown proteases. A salient finding was the identification of peptides from 88 classical/ non-classical secreted substrates in MEROPS, implicated in tumour progression and angiogenesis (FGFBP1, PLXDC2), cell-cell recognition and signalling (DDR1, GPA33), and tumour invasiveness and metastasis (MACC1, SMAGP); the nature of the proteases responsible for these proteolytic events is unknown. To confirm reproducibility of peptide fragment abundance in this study, we report the identification of a specific cleaved peptide fragment in the secretopeptidome from the colon-specific GPA33 antigen in 4/14 human CRC models. This improved secretopeptidome isolation and characterisation strategy has extended our understanding of endogenous peptides generated through proteolysis of classical/ non-classical secreted proteins, extracellular proteolytic processing of cell surface membrane proteins, and peptides generated through RIP. The novel peptide cleavage site information in this study provides a useful first step in detailing proteolytic cleavage associated with tumourigenesis and the extracellular environment. This article is part of a Special Issue entitled: An Updated Secretome.
    Biochimica et Biophysica Acta 05/2013; DOI:10.1016/j.bbapap.2013.05.006 · 4.66 Impact Factor