Large-scale mapping of human protein interactome using structural complexes

National Center for Biotechnology Information, US National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, Maryland 20894, USA.
EMBO Reports (Impact Factor: 9.06). 03/2012; 13(3):266-71. DOI: 10.1038/embor.2011.261
Source: PubMed


Although the identification of protein interactions by high-throughput (HTP) methods progresses at a fast pace, 'interactome' data sets still suffer from high rates of false positives and low coverage. To map the human protein interactome, we describe a new framework that uses experimental evidence on structural complexes, the atomic details of binding interfaces and evolutionary conservation. The structurally inferred interaction network is highly modular and more functionally coherent compared with experimental interaction networks derived from multiple literature citations. Moreover, structurally inferred and high-confidence HTP networks complement each other well, allowing us to construct a merged network to generate testable hypotheses and provide valuable experimental leads.

Download full-text


Available from: Anna R Panchenko
  • Source
    • "Most, if not all, of our biological functions rely on the activities of proteins, a versatile biomolecule . Proteins' functions depend on the interactions of it with its partner, which may be another protein or other biomolecules (like RNA or DNA or others), or an organic co-factor or inorganic prosthetic group (like Mg ++ , or O 2 , or Zn etc.) [4]. Thus the understandings of the interactions of protein with other partner proteins are very much essential to decipher the bio-molecular pathways *Address correspondence to this author at the Department of Biochemistry and Biophysics, University of Kalyani, Kalyani- 741235, Nadia, West Bengal, India; Tel: +91-9051948843; Fax: +91-33-2582-8282; E-mails:; "
    [Show abstract] [Hide abstract]
    ABSTRACT: Protein-Protein-Interactions (PPIs) are involved in almost all the cellular processes and understanding the structural basis of PPIs remains an important endeavor. The identification of the interface residues may shed light in many important aspects like drug development, elucidation of molecular pathways, generation of protein mimetics and understanding of disease mechanisms as well as development of docking methodologies to build structural models of protein complexes. Over the past few years, advances in high-throughput PPI identification techniques, such as yeast two-hybrid analysis and affinity purification coupled with mass spectrometry, have enabled the researchers to identify sets of interacting proteins in yeast, Droso-phila and other organisms. Unfortunately, these experimental methods do not provide residue level insight into the structure of the interactions between the proteins. The uses of X-Ray crystallography and nuclear magnetic resonance (NMR) spectroscopy to determine the structural basis of an interaction are time consuming and overall expensive. In response to these difficulties, a number of different bioinformatics algorithms with varying degrees of accuracies have been developed that use a wide variety of data sources to predict PPIs and modes of binding between proteins. Machine learning techniques such as Support Vector Machines (SVMs) and Random Forests (RFs) have been used recently to solve problems such as prediction of catalytic residues and prediction and analysis of structure-based PPI interfaces. Previous machine learning approaches to the PPI interface prediction problems used features pertaining to evolutionary amino acid sequence conservation , phylogeny, and GO (Gene Ontology) protein annotation and, in most of the cases, protein structures. Till date there are very few computational methods available that are based solely on protein sequences.
    Full-text · Article · Nov 2015 · Current Chemical Biology
  • Source
    • "The three-dimensional (3D) structure of a protein is key to determine its function (1,2). In order to exploit this relationship, proteins have been divided and classified according to their fold in databases such as SCOP (3). "
    [Show abstract] [Hide abstract]
    ABSTRACT: The function of a protein is determined by its three-dimensional structure, which is formed by regular (i.e. β-strands and α-helices) and non-periodic structural units such as loops. Compared to regular structural elements, non-periodic, non-repetitive conformational units enclose a much higher degree of variability—raising difficulties in the identification of regularities, and yet represent an important part of the structure of a protein. Indeed, loops often play a pivotal role in the function of a protein and different aspects of protein folding and dynamics. Therefore, the structural classification of protein loops is an important subject with clear applications in homology modelling, protein structure prediction, protein design (e.g. enzyme design and catalytic loops) and function prediction. ArchDB, the database presented here (freely available at, represents such a resource and has been an important asset for the scientific community throughout the years. In this article, we present a completely reworked and updated version of ArchDB. The new version of ArchDB features a novel, fast and user-friendly web-based interface, and a novel graph-based, computationally efficient, clustering algorithm. The current version of ArchDB classifies 149,134 loops in 5739 classes and 9608 subclasses.
    Full-text · Article · Nov 2013 · Nucleic Acids Research
  • [Show abstract] [Hide abstract]
    ABSTRACT: Proteins function through their interactions, and the availability of protein interaction networks could help in understanding cellular processes. However, the known structural data are limited and the classical network node-and-edge representation, where proteins are nodes and interactions are edges, shows only which proteins interact; not how they interact. Structural networks provide this information. Protein-protein interface structures can also indicate which binding partners can interact simultaneously and which are competitive, and can help forecasting potentially harmful drug side effects. Here, we use a powerful protein-protein interactions prediction tool which is able to carry out accurate predictions on the proteome scale to construct the structural network of the extracellular signal-regulated kinases (ERK) in the mitogen-activated protein kinase (MAPK) signaling pathway. This knowledge-based method, PRISM, is motif-based, and is combined with flexible refinement and energy scoring. PRISM predicts protein interactions based on structural and evolutionary similarity to known protein interfaces.
    No preview · Article · May 2012 · Current Opinion in Structural Biology
Show more