String Kernels and High-Quality Data Set for Improved Prediction of Kinked Helices in alpha-Helical Membrane Proteins

Johannes Gutenberg-University of Mainz , 55128 Mainz, Germany.
Journal of Chemical Information and Modeling (Impact Factor: 4.07). 11/2011; 51(11):3017-25. DOI: 10.1021/ci200278w
Source: PubMed

ABSTRACT The reasons for distortions from optimal α-helical geometry are widely unknown, but their influences on structural changes of proteins are significant. Hence, their prediction is a crucial problem in structural bioinformatics. For the particular case of kink prediction, we generated a data set of 132 membrane proteins containing 1014 manually labeled helices and examined the environment of kinks. Our sequence analysis confirms the great relevance of proline and reveals disproportionately high occurrences of glycine and serine at kink positions. The structural analysis shows significantly different solvent accessible surface area mean values for kinked and nonkinked helices. More important, we used this data set to validate string kernels for support vector machines as a new kink prediction method. Applying the new predictor, about 80% of all helices could be correctly predicted as kinked or nonkinked even when focusing on small helical fragments. The results exceed recently reported accuracies of alternative approaches and are a consequence of both the method and the data set.

  • [Show abstract] [Hide abstract]
    ABSTRACT: Novel N-heterocyclic carbene based mono and di-nuclear silver probes having the anthracence chromophore act as chemodosimeters for selective and sensitive detection of cyanide ions in the aqueous medium and signal the event through visible enhancement in fluorescence intensity.
    RSC Advances 09/2014; 4(89). DOI:10.1039/C4RA09969A · 3.71 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Kinks are functionally important structural features found in the α-helices of proteins. Structurally, they are points at which a helix abruptly chanαges direction. Current kink definition and identification methods often disagree with one another. Here we describe a crowdsourcing approach to obtain a reliable gold standard set of kinks. Using an online interface, we collected more than 10,000 classifications of 300 helices into straight, curved, or kinked categories. We found that participants were better at discriminating between straight and not-straight helices than between kinked and curved helices. Surprisingly, more obvious kinks were not necessarily identified as more localised within the helix. We present a set of 252 helices where more than 50% of the participants agree on a classification. This set can be used as a reliable gold standard to develop, train and compare computational methods. An interactive visualisation of the results is available online at experiment_results.php.
    Journal of Chemical Information and Modeling 08/2014; 54(9). DOI:10.1021/ci500403a · 4.07 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The class A G-protein-coupled receptors (GPCRs) Orexin-1 (OX1) and Orexin-2 (OX2) are located predominantly in the brain and are linked to a range of different physiological functions, including the control of feeding, energy metabolism, modulation of neuro-endocrine function, and regulation of the sleep–wake cycle. The natural agonists for OX1 and OX2 are two neuropeptides, Orexin-A and Orexin-B, which have activity at both receptors. Site-directed mutagenesis (SDM) has been reported on both the receptors and the peptides and has provided important insight into key features responsible for agonist activity. However, the structural interpretation of how these data are linked together is still lacking. In this work, we produced and used SDM data, homology modeling followed by MD simulation, and ensemble-flexible docking to generate binding poses of the Orexin peptides in the OX receptors to rationalize the SDM data. We also developed a protein pairwise similarity comparing method (ProS) and a GPCR-likeness assessment score (GLAS) to explore the structural data generated within a molecular dynamics simulation and to help distinguish between different GPCR substates. The results demonstrate how these newly developed methods of structural assessment for GPCRs can be used to provide a working model of neuropeptide–Orexin receptor interaction.
    Biochemistry 11/2013; 52(46):8246–8260. DOI:10.1021/bi401119m · 3.19 Impact Factor


Available from
May 19, 2014