Effects of protein conformation in docking: improved pose prediction through protein pocket adaptation

Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158-9001, USA.
Journal of Computer-Aided Molecular Design (Impact Factor: 2.78). 05/2009; 23(6):355-74. DOI: 10.1007/s10822-009-9266-3
Source: PubMed

ABSTRACT Computational methods for docking ligands have been shown to be remarkably dependent on precise protein conformation, where acceptable results in pose prediction have been generally possible only in the artificial case of re-docking a ligand into a protein binding site whose conformation was determined in the presence of the same ligand (the "cognate" docking problem). In such cases, on well curated protein/ligand complexes, accurate dockings can be returned as top-scoring over 75% of the time using tools such as Surflex-Dock. A critical application of docking in modeling for lead optimization requires accurate pose prediction for novel ligands, ranging from simple synthetic analogs to very different molecular scaffolds. Typical results for widely used programs in the "cross-docking case" (making use of a single fixed protein conformation) have rates closer to 20% success. By making use of protein conformations from multiple complexes, Surflex-Dock yields an average success rate of 61% across eight pharmaceutically relevant targets. Following docking, protein pocket adaptation and rescoring identifies single pose families that are correct an average of 67% of the time. Consideration of the best of two pose families (from alternate scoring regimes) yields a 75% mean success rate.

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Molecular docking is the most practical approach to leverage protein structure for ligand discovery, but the technique retains important liabilities that make it challenging to deploy on a large scale. We have therefore created an expert system, DOCK Blaster, to investigate the feasibility of full automation. The method requires a PDB code, sometimes with a ligand structure, and from that alone can launch a full screen of large libraries. A critical feature is self-assessment, which estimates the anticipated reliability of the automated screening results using pose fidelity and enrichment. Against common benchmarks, DOCK Blaster recapitulates the crystal ligand pose within 2 A rmsd 50-60% of the time; inferior to an expert, but respectrable. Half the time the ligand also ranked among the top 5% of 100 physically matched decoys chosen on the fly. Further tests were undertaken culminating in a study of 7755 eligible PDB structures. In 1398 cases, the redocked ligand ranked in the top 5% of 100 property-matched decoys while also posing within 2 A rmsd, suggesting that unsupervised prospective docking is viable. DOCK Blaster is available at .
    Journal of Medicinal Chemistry 09/2009; 52(18):5712-20. DOI:10.1021/jm9006966 · 5.48 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Computational methods for predicting ligand affinity where no protein structure is known generally take the form of regression analysis based on molecular features that have only a tangential relationship to a protein/ligand binding event. Such methods have limited utility when structural variation moves beyond congeneric series. We present a novel approach based on the multiple-instance learning method of Compass, where a physical model of a binding site is induced from ligands and their corresponding activity data. The model consists of molecular fragments that can account for multiple positions of literal protein residues. We demonstrate the method on 5HT1a ligands by training on a series with limited scaffold variation and testing on numerous ligands with variant scaffolds. Predictive error was between 0.5 and 1.0 log units (0.7-1.4 kcal/mol), with statistically significant rank correlations. Accurate activity predictions of novel ligands were demonstrated using a validation approach where a small number of ligands of limited structural variation known at a fixed time point were used to make predictions on a blind test set of widely varying molecules, some discovered at a much later time point.
    Journal of Medicinal Chemistry 09/2009; 52(19):6107-25. DOI:10.1021/jm901096y · 5.48 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Background: In the current situation of weak drug pipelines, impending patent expiration of several blockbuster drugs, industry consolidation and changing business models that target special diseases like cancer, diabetes, Alzheimer's and obesity, the pharmaceutical industry is under intense pressure to generate a strong drug pipeline distinguished by better productivity, diversity and cost effectiveness. The goal is discovering high-quality leads in the initial stages of the development cycle, to minimize the costs associated with failures at later ones. Objective: Thus, there is a great amount of interest in further developing and optimizing high-throughput screening and in silico screening, the two methods responsible for generating most of the lead compounds. Although high-throughput screening is the predominant starting point for discovery programs, in silico methods have gradually made inroads by their more rational approach, to expedite the drug discovery and development process. Conclusion: Modern drug discovery strategies include both methods in tandem or in an iterative way. This review primarily provides a succinct overview and comparison of experimental and in silico screening techniques, selected case studies where both methods were used in concert to investigate their performance and complementary nature and a statement on the developments in experimental and in silico approaches in the near future.
    Expert Opinion on Drug Discovery 09/2009; 4(9):947-959. DOI:10.1517/17460440903190961 · 3.47 Impact Factor


Available from