String Kernels and High-Quality Data Set for Improved Prediction of Kinked Helices in α-Helical Membrane Proteins

Johannes Gutenberg-University of Mainz , 55128 Mainz, Germany.
Journal of Chemical Information and Modeling (Impact Factor: 3.74). 11/2011; 51(11):3017-25. DOI: 10.1021/ci200278w
Source: PubMed


The reasons for distortions from optimal α-helical geometry are widely unknown, but their influences on structural changes of proteins are significant. Hence, their prediction is a crucial problem in structural bioinformatics. For the particular case of kink prediction, we generated a data set of 132 membrane proteins containing 1014 manually labeled helices and examined the environment of kinks. Our sequence analysis confirms the great relevance of proline and reveals disproportionately high occurrences of glycine and serine at kink positions. The structural analysis shows significantly different solvent accessible surface area mean values for kinked and nonkinked helices. More important, we used this data set to validate string kernels for support vector machines as a new kink prediction method. Applying the new predictor, about 80% of all helices could be correctly predicted as kinked or nonkinked even when focusing on small helical fragments. The results exceed recently reported accuracies of alternative approaches and are a consequence of both the method and the data set.

Download full-text


Available from: Benny Kneissl, Mar 24, 2014
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: We have carried out statistical analyses and computer simulations of helical kinks for TM helices in the PDBTM database. About 59 % of 1562 TM helices showed a significant kink, and 38 % of these kinks are associated with prolines in a range of ±4 residues. Our analyses show that helical kinks are more populated in the central region of helices, particularly in the range of 1-3 residues away from the helix center. Among 1,053 helical kinks analyzed, 88 % of kinks are bends (change in helix axis without loss of helical character) and 12 % are disruptions (change in helix axis and loss of helical character). It is found that proline residues tend to cause larger kink angles in helical bends, while this effect is not observed in helical disruptions. A further analysis of these kinked helices suggests that a kinked helix usually has 1-2 broken backbone hydrogen bonds with the corresponding N-O distance in the range of 4.2-8.7 Å, whose distribution is sharply peaked at 4.9 Å followed by an exponential decay with increasing distance. Our main aims of this study are to understand the formation of helical kinks and to predict their structural features. Therefore we further performed molecular dynamics (MD) simulations under four simulation scenarios to investigate kink formation in 37 kinked TM helices and 5 unkinked TM helices. The representative models of these kinked helices are predicted by a clustering algorithm, SPICKER, from numerous decoy structures possessing the above generic features of kinked helices. Our results show an accuracy of 95 % in predicting the kink position of kinked TM helices and an error less than 10° in the angle prediction of 71.4 % kinked helices. For unkinked helices, based on various structure similarity tests, our predicted models are highly consistent with their crystal structure. These results provide strong supports for the validity of our method in predicting the structure of TM helices.
    Full-text · Article · Sep 2012 · Journal of Computer-Aided Molecular Design
  • [Show abstract] [Hide abstract]
    ABSTRACT: Kinks have been observed to provide important functional and structural features for membrane proteins. Despite their ubiquity in membrane proteins, and their perceived importance, no protein modelling methods explicitly considers kinks. In spite of the limited data for transmembrane proteins, we were able to develop a knowledge-based modelling method for introducing kinks, which we demonstrate can be exploited in modelling approaches to improve the quality of models. The work entailed a thorough analysis of the available high resolution membrane protein structures, concomitantly demonstrating the complexity of the structural considerations for kink prediction. Furthermore, our results indicate that there are systematic and significant differences in the sequence as well as the structural environment between kinked and non-kinked transmembrane helices. To the best of our knowledge we are reporting a method for modeling kinks for the first time.
    No preview · Article · Oct 2013 · Journal of Chemical Information and Modeling
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The class A G-protein-coupled receptors (GPCRs) Orexin-1 (OX1) and Orexin-2 (OX2) are located predominantly in the brain and are linked to a range of different physiological functions, including the control of feeding, energy metabolism, modulation of neuro-endocrine function, and regulation of the sleep–wake cycle. The natural agonists for OX1 and OX2 are two neuropeptides, Orexin-A and Orexin-B, which have activity at both receptors. Site-directed mutagenesis (SDM) has been reported on both the receptors and the peptides and has provided important insight into key features responsible for agonist activity. However, the structural interpretation of how these data are linked together is still lacking. In this work, we produced and used SDM data, homology modeling followed by MD simulation, and ensemble-flexible docking to generate binding poses of the Orexin peptides in the OX receptors to rationalize the SDM data. We also developed a protein pairwise similarity comparing method (ProS) and a GPCR-likeness assessment score (GLAS) to explore the structural data generated within a molecular dynamics simulation and to help distinguish between different GPCR substates. The results demonstrate how these newly developed methods of structural assessment for GPCRs can be used to provide a working model of neuropeptide–Orexin receptor interaction.
    Preview · Article · Nov 2013 · Biochemistry
Show more