Factor analysis scales of generalized amino acid information as applied in predicting interactions between the human amphiphysin-1 SH3 domains and their peptide ligands.

College of Bioengineering, Chongqing University, Chongqing 400030, PR China.
Chemical Biology &amp Drug Design (Impact Factor: 2.47). 05/2008; 71(4):345-51. DOI: 10.1111/j.1747-0285.2008.00641.x
Source: PubMed

ABSTRACT Factor analysis scales of generalized amino acid information (FASGAI) involving hydrophobicity, alpha-helix and beta-turn propensities, bulky properties, compositional characteristics, local flexibility, and electronic properties, was proposed to represent the structures of the decapeptides binding the human amphiphysin-1 SH3 domains. Parameters being responsible for the binding affinities were selected by genetic algorithm, and a quantitative structure-affinity relationship (QSAR) model by partial least square was established to predict the peptide-SH3 domain interactions. Diversified properties of the residues between P(2) and P(-3) (including P(2) and P(-3)) of the decapeptide (P(4)P(3)P(2)P(1)P(0)P(-1)P(-2)P(-3)P(-4)P(-5)) may contribute remarkable effect to the interactions between the SH3 domain and the decapeptide. Particularly, electronic properties of P(2) may provide relatively large positive contributions to the interactions, and reversely, hydrophobicity of P(2) may be largely negative to the interactions. These results showed that FASGAI vectors can well represent the structural characteristics of the decapeptides. Furthermore, the model obtained, which showed low computational complexity, correlated FASGAI descriptors with the binding affinities as well as that FASGAI vectors may also be applied in QSAR studies of peptides.

  • [Show abstract] [Hide abstract]
    ABSTRACT: Peptides play significant roles in the biological world. To optimize activity for a specific therapeutic target, peptide library synthesis is inevitable; which is a time consuming and expensive. Computational approaches provide a promising way to simply elucidate the structural basis in the design of new peptides. Earlier, we proposed a novel methodology termed HomoSAR to gain insight into the structure activity relationships underlying peptides. Based on an integrated approach, HomoSAR uses the principles of homology modeling in conjunction with the quantitative structural activity relationship formalism to predict and design new peptide sequences with the optimum activity. In the present study, we establish that the HomoSAR methodology can be universally applied to all classes of peptides irrespective of sequence length by studying HomoSAR on three peptide datasets viz., angiotensin-converting enzyme inhibitory peptides, CAMEL-s antibiotic peptides, and hAmphiphysin-1 SH3 domain binding peptides, using a set of descriptors related to the hydrophobic, steric, and electronic properties of the 20 natural amino acids. Models generated for all three datasets have statistically significant correlation coefficients (r(2) ) and predictive r2 (rpred 2) and cross validated coefficient ( qLOO 2). The daintiness of this technique lies in its simplicity and ability to extract all the information contained in the peptides to elucidate the underlying structure activity relationships. The difficulties of correlating both sequence diversity and variation in length of the peptides with their biological activity can be addressed. The study has been able to identify the preferred or detrimental nature of amino acids at specific positions in the peptide sequences. © 2013 Wiley Periodicals, Inc.
    Journal of Computational Chemistry 09/2013; · 3.84 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Domain-peptide recognition and interaction are fundamentally important for eukaryotic signaling and regulatory networks. It is thus essential to quantitatively infer the binding stability and specificity of such interaction based upon large-scale but low-accurate complex structure models which could be readily obtained from sophisticated molecular modeling procedure. In the present study, a new method is described for the fast and reliable prediction of domain-peptide binding affinity with coarse-grained structure models. This method is designed to tolerate strong random noises involved in domain-peptide complex structures and uses statistical modeling approach to eliminate systematic bias associated with a group of investigated samples. As a paradigm, this method was employed to model and predict the binding behavior of various peptides to four evolutionarily unrelated peptide-recognition domains (PRDs), i.e. human amph SH3, human nherf PDZ, yeast syh GYF and yeast bmh 14-3-3, and moreover, we explored the molecular mechanism and biological implication underlying the binding of cognate and noncognate peptide ligands to their domain receptors. It is expected that the newly proposed method could be further used to perform genome-wide inference of domain-peptide binding at three-dimensional structure level.
    Bio Systems 05/2013; · 1.27 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Peptide-mediated interactions are crucial to a variety of functions in the living cell and are estimated to be involved in up to 40 % of all cellular processes. Fast and reliable inference of such interactions is fundamentally important for our understanding and, then, reconstruction of complete virtual interactomics involved in a specific cell, tissue or organism. In the current study, we performed structure-level characterization, modeling and prediction of protein-peptide recognition specificity and stability in a high-throughput manner. To achieve this, the classical chemometrics methodology quantitative structure-activity relationship (QSAR), which is traditionally applied to small-molecule entities such as drug compounds and environmental chemicals, was employed to statistically correlate structure features with binding affinities for a panel of structure-solved, affinity-known protein-peptide complexes compiled from the PDB database and literatures. In the standard QSAR procedure, various structural descriptors including physicochemical, geometrical and constitutional parameters that characterize diverse aspects of protein-peptide interaction property were derived from the biomacromolecular complex structure architecture, and these descriptors were then correlated with experimentally measured affinities by using the partial least squares (PLS) regression and Gaussian process (GP) in conjunction with genetic algorithm (GA) variable selection. The nonlinear GA/GP method was found to perform much well as compared to linear GA/PLS modeling, suggesting that the protein-peptide interaction system is highly complicated that may involve strong noise and interactive effect. The optimal GA/GP model revealed that the interface size and solvent effect play a critical role in protein-peptide binding, and other properties such as peptide length and flexibility also contribute significantly to the binding. A further test on 2,018 human amphiphysin SH3 domain-binding peptides demonstrated that the purposed QSAR modeling procedure is very fast and effective, which can thus be readily used to perform proteome-wide inference of peptide-mediated interactions.
    The Protein Journal 10/2013; 32(7):568-78. · 1.13 Impact Factor