Real spherical harmonic expansion coefficients as 3D shape descriptors for protein binding pocket and ligand comparisons

EMBL-EBI, Cambridge, England, United Kingdom
Bioinformatics (Impact Factor: 4.62). 06/2005; 21(10):2347-55. DOI: 10.1093/bioinformatics/bti337
Source: PubMed

ABSTRACT An increasing number of protein structures are being determined for which no biochemical characterization is available. The analysis of protein structure and function assignment is becoming an unexpected challenge and a major bottleneck towards the goal of well-annotated genomes. As shape plays a crucial role in biomolecular recognition and function, the examination and development of shape description and comparison techniques is likely to be of prime importance for understanding protein structure-function relationships.
A novel technique is presented for the comparison of protein binding pockets. The method uses the coefficients of a real spherical harmonics expansion to describe the shape of a protein's binding pocket. Shape similarity is computed as the L2 distance in coefficient space. Such comparisons in several thousands per second can be carried out on a standard linux PC. Other properties such as the electrostatic potential fit seamlessly into the same framework. The method can also be used directly for describing the shape of proteins and other molecules.
A limited version of the software for the real spherical harmonics expansion of a set of points in PDB format is freely available upon request from the authors. Binding pocket comparisons and ligand prediction will be made available through the protein structure annotation pipeline Profunc (written by Roman Laskowski) which will be accessible from the EBI website shortly.

  • [Show abstract] [Hide abstract]
    ABSTRACT: The paper deals with the identification of binding sites and concentrates on interactions involving small interfaces. In particular we focus our attention on two major interface types, namely protein-ligand and protein-peptide interfaces. As concerns protein-ligand binding site prediction, we classify the most interesting methods and approaches into four main categories: (a) shape-based methods, (b) alignment-based methods, (c) graph-theoretic approaches and (d) machine learning methods. Class (a) encompasses those methods which employ, in some way, geometric information about the protein surface. Methods falling into class (b) address the prediction problem as an alignment problem, i.e. finding protein-ligand atom pairs that occupy spatially equivalent positions. Graph theoretic approaches, class (c), are mainly based on the definition of a particular graph, known as the protein contact graph, and then apply some sophisticated methods from graph theory to discover subgraphs or score similarities for uncovering functional sites. The last class (d) contains those methods that are based on the learn-from-examples paradigm and that are able to take advantage of the large amount of data available on known protein-ligand pairs. As for protein-peptide interfaces, due to the often disordered nature of the regions involved in binding, shape similarity is no longer a determining factor. Then, in geometry-based methods, geometry is accounted for by providing the relative position of the atoms surrounding the peptide residues in known structures. Finally, also for protein-peptide interfaces, we present a classification of some successful machine learning methods. Indeed, they can be categorized in the way adopted to construct the learning examples. In particular, we envisage three main methods: distance functions, structure and potentials and structure alignment.
    European Physical Journal Plus 06/2014; 129(6). DOI:10.1140/epjp/i2014-14132-1 · 1.48 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Proteins molecular recognition play an important role in their func-tion. Determining which ligand can bind to a protein is a complex matter due to the nature of protein-ligand interactions and flexibility of binding sites. However, geometric complementarity has often been observed between the ligand and its binding site. Under the assumption that geometrically similar binding sites bind the same ligand, binding sites are mainly studied using three dimensional and graph based representations. In this paper, we present a model for two dimen-sional ligand binding pockets representation and we apply it to pocket-pocket matching and binding ligand prediction. This model is based on surface mapping of the binding site and makes use of two dimensional Pseudo-Zernike descrip-tors. Our results show that for certain classes of ligands (HEM, NAD, PO4), up to 60% of binding sites are correctly predicted to belong to the right class.
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Structural bioinformatics is an area that has emerged to comprehend and interpret large amounts of structural data, and promises to provide a high resolution understanding of biology. Protein structures can be compared, analyzed and mined in various ways, which allows us to understand the functions of these molecules and reason precisely how and why such capabilities have emerged in them. The main advantages these methods have over simpler sequence based methods are that besides helping in associating a molecule with a function, they also provide ultimate insights into the mechanisms by which various biological events take place. This report provides an overview of structural bioinformatics, various advances in the recent years and the range and scope of data driven protein structural analysis. In particular, current trends in structure prediction, structure alignments, deriving sub-structures and structural motifs, understanding features critical for molecular recognition as well as using these for understanding function of protein molecules are presented. Application of structural knowledge in drug discovery for lead identification as well as novel ways of understanding drug adverse effects and drug resistance are also discussed. Finally prospects for structure based vaccine design are also outlined. The various aspects of structural bioinformatics discussed here, show how biological insights can be obtained from protein structures.

Full-text (2 Sources)

Available from
Jun 4, 2014