PIER: protein interface recognition for structural proteomics.

Scripps Research Institute, La Jolla, California 92037, USA.
Proteins Structure Function and Bioinformatics (Impact Factor: 3.34). 06/2007; 67(2):400-17. DOI: 10.1002/prot.21233
Source: PubMed

ABSTRACT Recent advances in structural proteomics call for development of fast and reliable automatic methods for prediction of functional surfaces of proteins with known three-dimensional structure, including binding sites for known and unknown protein partners as well as oligomerization interfaces. Despite significant progress the problem is still far from being solved. Most existing methods rely, at least partially, on evolutionary information from multiple sequence alignments projected on protein surface. The common drawback of such methods is their limited applicability to the proteins with a sparse set of sequential homologs, as well as inability to detect interfaces in evolutionary variable regions. In this study, the authors developed an improved method for predicting interfaces from a single protein structure, which is based on local statistical properties of the protein surface derived at the level of atomic groups. The proposed Protein IntErface Recognition (PIER) method achieved the overall precision of 60% at the recall threshold of 50% at the residue level on a diverse benchmark of 490 homodimeric, 62 heterodimeric, and 196 transient interfaces (compared with 25% precision at 50% recall expected from random residue function assignment). For 70% of proteins in the benchmark, the binding patch residues were successfully detected with precision exceeding 50% at 50% recall. The calculation only took seconds for an average 300-residue protein. The authors demonstrated that adding the evolutionary conservation signal only marginally influenced the overall prediction performance on the benchmark; moreover, for certain classes of proteins, using this signal actually resulted in a deteriorated prediction. Thorough benchmarking using other datasets from literature showed that PIER yielded improved performance as compared with several alignment-free or alignment-dependent predictions. The accuracy, efficiency, and dependence on structure alone make PIER a suitable tool for automated high-throughput annotation of protein structures emerging from structural proteomics projects.

  • [Show abstract] [Hide abstract]
    ABSTRACT: The small GTPase RhoA promotes deregulated signalling upon interaction with Lbc, the oncogenic form of A-kinase anchoring protein (AKAP). The onco-Lbc protein is a hyperactive Rho-specific guanine nucleotide exchange factor (GEF), but its structural mechanism has not been reported, despite its involvement in cardiac hypertrophy and cancer causation. The pleckstrin homology (PH) domain of Lbc is located at the C-terminal end of the protein, and is shown here to specifically recognize activated RhoA rather than lipids. The isolated dbl homology (DH) domain can function as an independent activator with an enhanced activity. However, the DH domain normally does not act as a solitary Lbc interface with RhoA-GDP. Instead it is negatively controlled by the PH domain. In particular the DH helical bundle is coupled to the structurally dependent PH domain through a helical linker, which reduces its activity. Together the two domains form a rigid scaffold in solution, as evidenced by small angle X-ray scattering and 1H,13C,15N-based NMR spectroscopy. The two domains assume a "chair" shape, with its back possessing independent GEF activity, and the PH domain providing a broad seat for RhoA-GTP docking rather than membrane recognition. This provides structural and dynamical insights into how DH and PH domains work together in solution to support regulated RhoA activity. Mutational analysis supports the bifunctional PH domain mediation of DH:RhoA interactions and explains why the tandem domain is required for controlled GEF signaling.
    Journal of Biological Chemistry 07/2014; · 4.60 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The identification of protein-protein interaction sites is a computationally challenging task and importantfor understanding the biology of protein complexes. There is a rich literature in this field. A broadclass of approaches assign to each candidate residue a real-valued score that measures how likely it isthat the residue belongs to the interface. The prediction is obtained by thresholding this score.Some probabilistic models classify the residues on the basis of the posterior probabilities. In thispaper, we introduce pairwise conditional random fields (pCRFs) in which edges are not restrictedto the backbone as in the case of linear-chain CRFs utilized by Li et al. (2007). In fact, any 3Dneighborhoodrelation can be modeled. On grounds of a generalized Viterbi inference algorithm anda piecewise training process for pCRFs, we demonstrate how to utilize pCRFs to enhance a givenresidue-wise score-based protein-protein interface predictor on the surface of the protein under study.The features of the pCRF are solely based on the interface predictions scores of the predictor theperformance of which shall be improved.
    BMC Bioinformatics 08/2014; 15(1):277. · 2.67 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: The function of a protein is determined by its intrinsic activity in the context of its subcellular distribution. Membranes localize proteins within cellular compartments and govern their specific activities. Discovering such membrane-protein interactions is important for understanding biological mechanisms and could uncover novel sites for therapeutic intervention. We present a method for detecting membrane interactive proteins and their exposed residues that insert into lipid bilayers. Although the development process involved analysis of how C1b, C2, ENTH, FYVE, Gla, pleckstrin homology (PH), and PX domains bind membranes, the resulting membrane optimal docking area (MODA) method yields predictions for a given protein of known three-dimensional structures without referring to canonical membrane-targeting modules. This approach was tested on the Arf1 GTPase, ATF2 acetyltransferase, von Willebrand factor A3 domain, and Neisseria gonorrhoeae MsrB protein and further refined with membrane interactive and non-interactive FAPP1 and PKD1 pleckstrin homology domains, respectively. Furthermore we demonstrate how this tool can be used to discover unprecedented membrane binding functions as illustrated by the Bro1 domain of Alix, which was revealed to recognize lysobisphosphatidic acid (LBPA). Validation of novel membrane-protein interactions relies on other techniques such as nuclear magnetic resonance spectroscopy (NMR), which was used here to map the sites of micelle interaction. Together this indicates that genome-wide identification of known and novel membrane interactive proteins and sites is now feasible and provides a new tool for functional annotation of the proteome.
    Biochemistry and Cell Biology 09/2014; 92(6):1-9. · 2.35 Impact Factor


Available from