Real spherical harmonic expansion coefficients as 3D shape descriptors for protein binding pocket and ligand comparisons

EMBL-EBI, Cambridge, England, United Kingdom
Bioinformatics (Impact Factor: 4.62). 06/2005; 21(10):2347-55. DOI: 10.1093/bioinformatics/bti337
Source: PubMed

ABSTRACT An increasing number of protein structures are being determined for which no biochemical characterization is available. The analysis of protein structure and function assignment is becoming an unexpected challenge and a major bottleneck towards the goal of well-annotated genomes. As shape plays a crucial role in biomolecular recognition and function, the examination and development of shape description and comparison techniques is likely to be of prime importance for understanding protein structure-function relationships.
A novel technique is presented for the comparison of protein binding pockets. The method uses the coefficients of a real spherical harmonics expansion to describe the shape of a protein's binding pocket. Shape similarity is computed as the L2 distance in coefficient space. Such comparisons in several thousands per second can be carried out on a standard linux PC. Other properties such as the electrostatic potential fit seamlessly into the same framework. The method can also be used directly for describing the shape of proteins and other molecules.
A limited version of the software for the real spherical harmonics expansion of a set of points in PDB format is freely available upon request from the authors. Binding pocket comparisons and ligand prediction will be made available through the protein structure annotation pipeline Profunc (written by Roman Laskowski) which will be accessible from the EBI website shortly.

Download full-text


Available from: Abdullah Kahraman, Jun 29, 2015
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Modern developments in light microscopy have allowed the observation of cell deformation with remarkable spatiotemporal resolution and reproducibility. Analyzing such phenomena is of particular interest for the signal processing and computer vision communities due to the numerous computational challenges involved, from image acquisition all the way to shape analysis and pattern recognition and interpretation. This article aims at providing an up-to-date overview of the problems, solutions, and remaining challenges in deciphering the morphology of living cells via computerized approaches, with a particular focus on shape description frameworks and their exploitation using machine-learning techniques. As a concrete illustration, we use our recently acquired data on amoeboid cell deformation, motivated by its direct implication in immune responses, bacterial invasion, and cancer metastasis.
    IEEE Signal Processing Magazine 01/2015; 32(1):30-40. DOI:10.1109/MSP.2014.2359131 · 4.48 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: MOTIVATION: Receptor-ligand interactions are a central phenomenon in most biological systems. They are characterized by molecular recognition, a complex process mainly driven by physicochemical and structural properties of both receptor and ligand. Understanding and predicting these interactions are major steps towards protein ligand prediction, target identification, lead discovery and drug design. RESULTS: We propose a novel graph-based binding pocket signature called aCSM, that proved to be efficient and effective in handling large-scale protein ligand prediction tasks. We compare our results with those described in the literature and demonstrate that our algorithm overcomes the competitors techniques. Finally, we predict novel ligands for proteins from Trypanosoma cruzi, the parasite responsible for Chagas Disease, and validate them in silico via a docking protocol, showing the applicability of the method in suggesting ligands for pockets in a real-world scenario.Availability and Implementation: Data sets and the source code are available at CONTACT: and
    Bioinformatics 02/2013; 29(7). DOI:10.1093/bioinformatics/btt058 · 4.62 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Drug effects are mainly caused by the interactions between drug molecules and their target proteins including primary targets and off-targets. Identification of the molecular mechanisms behind overall drug-target interactions is crucial in the drug design process. We develop a classifier-based approach to identify chemogenomic features (the underlying associations between drug chemical substructures and protein domains) that are involved in drug-target interaction networks. We propose a novel algorithm for extracting informative chemogenomic features by using L(1) regularized classifiers over the tensor product space of possible drug-target pairs. It is shown that the proposed method can extract a very limited number of chemogenomic features without loosing the performance of predicting drug-target interactions and the extracted features are biologically meaningful. The extracted substructure-domain association network enables us to suggest ligand chemical fragments specific for each protein domain and ligand core substructures important for a wide range of protein families. Softwares are available at the supplemental website. Datasets and all results are available at .
    Bioinformatics 09/2012; 28(18):i487-i494. DOI:10.1093/bioinformatics/bts412 · 4.62 Impact Factor