Improved prediction of critical residues for protein function based on network and phylogenetic analyses

Buck Institute For Age Research, 8001 Redwood Blvd, Novato, CA 94945, USA.
BMC Bioinformatics (Impact Factor: 2.67). 02/2005; 6:213. DOI: 10.1186/1471-2105-6-213
Source: PubMed

ABSTRACT Phylogenetic approaches are commonly used to predict which amino acid residues are critical to the function of a given protein. However, such approaches display inherent limitations, such as the requirement for identification of multiple homologues of the protein under consideration. Therefore, complementary or alternative approaches for the prediction of critical residues would be desirable. Network analyses have been used in the modelling of many complex biological systems, but only very recently have they been used to predict critical residues from a protein's three-dimensional structure. Here we compare a couple of phylogenetic approaches to several different network-based methods for the prediction of critical residues, and show that a combination of one phylogenetic method and one network-based method is superior to other methods previously employed.
We associate a network with each member of a set of proteins for which the three-dimensional structure is known and the critical residues have been previously determined experimentally. We show that several network-based centrality measurements (connectivity, 2-connectivity, closeness centrality, betweenness and cluster coefficient) accurately detect residues critical for the protein's function. Phylogenetic approaches render predictions as reliable as the network-based measurements, although, interestingly, the two general approaches tend to predict different sets of critical residues. Hence we propose a hybrid method that is composed of one network-based calculation--the closeness centrality--and one phylogenetic approach--the Conseq server. This hybrid approach predicts critical residues more accurately than the other methods tested here.
We show that network analysis can be used to improve the prediction of amino acids critical for protein function, when utilized in combination with phylogenetic approaches. It is proposed that such improvement is due to the complementary nature of these approaches: network-based methods tend to predict as critical those residues that are highly connected and internal (i.e., non-surface), although some surface residues are indeed identified as critical by network analyses; whereas residues chosen by phylogenetic approaches display a lower overall probability of being surface inaccessible.

  • [Show abstract] [Hide abstract]
    ABSTRACT: Computational studies of allosteric interactions have witnessed a recent renaissance fueled by growing interest in the modeling of complex molecular assemblies and biological networks. Allosteric interactions of the molecular chaperone Hsp90 with a diverse array of cochaperones and client proteins allow for molecular communication in signal transduction networks. In this review, recent developments in the understanding of allosteric interactions in the context of structural, functional, and computational studies of the Hsp90 chaperone are discussed. A comprehensive analysis of structural and network-based models of protein allostery is provided. Computational and experimental approaches and advances in the understanding of Hsp90 interactions and regulatory mechanisms are reviewed to provide a systematic and critical view of the current progress and most challenging questions in the field. The current status and future prospects for translational research, bridging the basic science of chaperones with the discovery of anti-cancer therapies, are also highlighted.
    Israel Journal of Chemistry (Online) 08/2014; 54(8-9). DOI:10.1002/ijch.201300143 · 2.56 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Relating a gene mutation to a phenotype is a common task in different disciplines such as protein biochemistry. In this endeavour, it is common to find false relationships arising from mutations introduced by cells that may be depurated using a phenotypic assay; yet, such phenotypic assays may introduce additional false relationships arising from experimental errors. Here we introduce the use of high-throughput DNA sequencers and statistical analysis aimed to identify incorrect DNA sequence-phenotype assignments and observed that 10-20% of these false assignments are expected in large screenings aimed to identify critical residues for protein function. We further show that this level of incorrect DNA sequence-phenotype assignments may significantly alter our understanding about the structure-function relationship of proteins. We have made available an implementation of our method at
    PLoS ONE 02/2015; 10(2):e0118288. DOI:10.1371/journal.pone.0118288 · 3.53 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: G protein-coupled receptors (GPCRs) are a superfamily of membrane proteins of vast pharmaceutical interest. Here, we describe a graph theory-based analysis of the structure of the β2 adrenergic receptor (β2 AR), a prototypical GPCR. In particular, we illustrate the network of direct and indirect interactions that link each amino acid residue to any other residue of the receptor. Networks of interconnected amino acid residues in proteins are analogous to social networks of interconnected people. Hence, they can be studied through the same analysis tools typically employed to analyze social networks - or networks in general - to reveal patterns of connectivity, influential members, and dynamicity. We focused on the analysis of closeness-centrality, which is a measure of the overall connectivity distance of the member of a network to all other members. The residues endowed with the highest closeness-centrality are located in the middle of the seven transmembrane domains (TMs). In particular, they are mostly located in the middle of TM2, TM3, TM6 or TM7, while fewer of them are located in the middle of TM1, TM4 or TM5. At the cytosolic end of TM6, the centrality detected for the active structure is markedly lower than that detected for the corresponding residues in the inactive structures. Moreover, several residues acquire centrality when the structures are analyzed in the presence of ligands. Strikingly, there is little overlap between the residues that acquire centrality in the presence of the ligand in the blocker-bound structures and the agonist-bound structures. Our results reflect the fact that the receptor resembles a bow tie, with a rather tight knot of closely interconnected residues and two ends that fan out in two opposite directions: one toward the extracellular space, which hosts the ligand binding cavity, and one toward the cytosol, which hosts the G protein binding cavity. Moreover, they underscore how interaction network is by the conformational rearrangements concomitant with the activation of the receptor and by the presence of agonists or blockers.


1 Download
Available from