Publications (8)35.59 Total impact
-
Article: Computational tools and resources for metabolism-related property predictions. 1. Overview of publicly available (free and commercial) databases and software.
[show abstract] [hide abstract]
ABSTRACT: Metabolism has been identified as a defining factor in drug development success or failure because of its impact on many aspects of drug pharmacology, including bioavailability, half-life and toxicity. In this article, we provide an outline and descriptions of the resources for metabolism-related property predictions that are currently either freely or commercially available to the public. These resources include databases with data on, and software for prediction of, several end points: metabolite formation, sites of metabolic transformation, binding to metabolizing enzymes and metabolic stability. We attempt to place each tool in historical context and describe, wherever possible, the data it was based on. For predictions of interactions with metabolizing enzymes, we show a typical set of results for a small test set of compounds. Our aim is to give a clear overview of the areas and aspects of metabolism prediction in which the currently available resources are useful and accurate, and the areas in which they are inadequate or missing entirely.Future medicinal chemistry 10/2012; 4(15):1907-32. · 2.52 Impact Factor -
Article: QSAR classification model for antibacterial compounds and its use in virtual screening.
[show abstract] [hide abstract]
ABSTRACT: As novel and drug-resistant bacterial strains continue to present an emerging health threat, the development of new antibacterial agents is critical. This includes making improvements to existing antibacterial scaffolds as well as identifying novel ones. The aim of this study is to apply a Bayesian classification QSAR approach to rapidly screen chemical libraries for compounds predicted to have antibacterial activity. Toward this end we assembled a data set of 317 known antibacterial compounds as well as a second data set of diverse, well-validated, non-antibacterial compounds from 215 PubChem Bioassays against various bacterial species. We constructed a Bayesian classification model using structural fingerprints and physicochemical property descriptors and achieved an accuracy of 84% and precision of 86% on an independent test set in identifying antibacterial compounds. To demonstrate the practical applicability of the model in virtual screening, we screened an independent data set of ∼200k compounds. The results show that the model can screen top hits of PubChem Bioassay actives with accuracy up to ∼76%, representing a 1.5-2-fold enrichment. The top screened hits represented a mixture of both known antibacterial scaffolds as well as novel scaffolds. Our study suggests that a well-validated Bayesian classification QSAR approach could compliment other screening approaches in identifying novel and promising hits. The data sets used in constructing and validating this model have been made publicly available.Journal of Chemical Information and Modeling 09/2012; 52(10):2559-69. · 4.68 Impact Factor -
Article: Locally weighted learning methods for predicting dose-dependent toxicity with application to the human maximum recommended daily dose.
[show abstract] [hide abstract]
ABSTRACT: Toxicological experiments in animals are carried out to determine the type and severity of any potential toxic effect associated with a new lead compound. The collected data are then used to extrapolate the effects on humans and determine initial dose regimens for clinical trials. The underlying assumption is that the severity of the toxic effects in animals is correlated with that in humans. However, there is a general lack of toxic correlations across species. Thus, it is more advantageous to predict the toxicological effects of a compound on humans directly from the human toxicological data of related compounds. However, many popular quantitative structure-activity relationship (QSAR) methods that build a single global model by fitting all training data appear inappropriate for predicting toxicological effects of structurally diverse compounds because the observed toxicological effects may originate from very different and mostly unknown molecular mechanisms. In this article, we demonstrate, via application to the human maximum recommended daily dose data that locally weighted learning methods, such as k-nearest neighbors, are well suited for predicting toxicological effects of structurally diverse compounds. We also show that a significant flaw of the k-nearest neighbor method is that it always uses a constant number of nearest neighbors in making prediction for a target compound, irrespective of whether the nearest neighbors are structurally similar enough to the target compound to ensure that they share the same mechanism of action. To remedy this flaw, we proposed and implemented a variable number nearest neighbor method. The advantages of the variable number nearest neighbor method over other QSAR methods include (1) allowing more reliable predictions to be achieved by applying a tighter molecular distance threshold and (2) automatic detection for when a prediction should not be made because the compound is outside the applicable domain.Chemical Research in Toxicology 09/2012; 25(10):2216-26. · 3.78 Impact Factor -
Article: Quantitative predictions of binding free energy changes in drug-resistant influenza neuraminidase.
[show abstract] [hide abstract]
ABSTRACT: Quantitatively predicting changes in drug sensitivity associated with residue mutations is a major challenge in structural biology. By expanding the limits of free energy calculations, we successfully identified mutations in influenza neuraminidase (NA) that confer drug resistance to two antiviral drugs, zanamivir and oseltamivir. We augmented molecular dynamics (MD) with Hamiltonian Replica Exchange and calculated binding free energy changes for H274Y, N294S, and Y252H mutants. Based on experimental data, our calculations achieved high accuracy and precision compared with results from established computational methods. Analysis of 15 µs of aggregated MD trajectories provided insights into the molecular mechanisms underlying drug resistance that are at odds with current interpretations of the crystallographic data. Contrary to the notion that resistance is caused by mutant-induced changes in hydrophobicity of the binding pocket, our simulations showed that drug resistance mutations in NA led to subtle rearrangements in the protein structure and its dynamics that together alter the active-site electrostatic environment and modulate inhibitor binding. Importantly, different mutations confer resistance through different conformational changes, suggesting that a generalized mechanism for NA drug resistance is unlikely.PLoS Computational Biology 08/2012; 8(8):e1002665. · 5.22 Impact Factor -
Article: 2D SMARTCyp reactivity-based site of metabolism prediction for major drug-metabolizing cytochrome P450 enzymes.
[show abstract] [hide abstract]
ABSTRACT: Cytochrome P450 (CYP) 3A4, 2D6, 2C9, 2C19, and 1A2 are the most important drug-metabolizing enzymes in the human liver. Knowledge of which parts of a drug molecule are subject to metabolic reactions catalyzed by these enzymes is crucial for rational drug design to mitigate ADME/toxicity issues. SMARTCyp, a recently developed 2D ligand structure-based method, is able to predict site-specific metabolic reactivity of CYP3A4 and CYP2D6 substrates with an accuracy that rivals the best and more computationally demanding 3D structure-based methods. In this article, the SMARTCyp approach was extended to predict the metabolic hotspots for CYP2C9, CYP2C19, and CYP1A2 substrates. This was accomplished by taking into account the impact of a key substrate-receptor recognition feature of each enzyme as a correction term to the SMARTCyp reactivity. The corrected reactivity was then used to rank order the likely sites of CYP-mediated metabolic reactions. For 60 CYP1A2 substrates, the observed major sites of CYP1A2 catalyzed metabolic reactions were among the top-ranked 1, 2, and 3 positions in 67%, 80%, and 83% of the cases, respectively. The results were similar to those obtained by MetaSite and the reactivity + docking approach. For 70 CYP2C9 substrates, the observed sites of CYP2C9 metabolism were among the top-ranked 1, 2, and 3 positions in 66%, 86%, and 87% of the cases, respectively. These results were better than the corresponding results of StarDrop version 5.0, which were 61%, 73%, and 77%, respectively. For 36 compounds metabolized by CYP2C19, the observed sites of metabolism were found to be among the top-ranked 1, 2, and 3 sites in 78%, 89%, and 94% of the cases, respectively. The computational procedure was implemented as an extension to the program SMARTCyp 2.0. With the extension, the program can now predict the site of metabolism for all five major drug-metabolizing enzymes with an accuracy similar to or better than that achieved by the best 3D structure-based methods. Both the Java source code and the binary executable of the program are freely available to interested users.Journal of Chemical Information and Modeling 05/2012; 52(6):1698-712. · 4.68 Impact Factor -
Article: A physicochemical descriptor-based scoring scheme for effective and rapid filtering of kinase-like chemical space.
[show abstract] [hide abstract]
ABSTRACT: The current chemical space of known small molecules is estimated to exceed 1060 structures. Though the largest physical compound repositories contain only a few tens of millions of unique compounds, virtual screening of databases of this size is still difficult. In recent years, the application of physicochemical descriptor-based profiling, such as Lipinski's rule-of-five for drug-likeness and Oprea's criteria of lead-likeness, as early stage filters in drug discovery has gained widespread acceptance. In the current study, we outline a kinase-likeness scoring function based on known kinase inhibitors. The method employs a collection of 22,615 known kinase inhibitors from the ChEMBL database. A kinase-likeness score is computed using statistical analysis of nine key physicochemical descriptors for these inhibitors. Based on this score, the kinase-likeness of four publicly and commercially available databases, i.e., National Cancer Institute database (NCI), the Natural Products database (NPD), the National Institute of Health's Molecular Libraries Small Molecule Repository (MLSMR), and the World Drug Index (WDI) database, is analyzed. Three of these databases, i.e., NCI, NPD, and MLSMR are frequently used in the virtual screening of kinase inhibitors, while the fourth WDI database is for comparison since it covers a wide range of known chemical space. Based on the kinase-likeness score, a kinase-focused library is also developed and tested against three different kinase targets selected from three different branches of the human kinome tree. Our proposed methodology is one of the first that explores how the narrow chemical space of kinase inhibitors and its relevant physicochemical information can be utilized to build kinase-focused libraries and prioritize pre-existing compound databases for screening. We have shown that focused libraries generated by filtering compounds using the kinase-likeness score have, on average, better docking scores than an equivalent number of randomly selected compounds. Beyond library design, our findings also impact the broader efforts to identify kinase inhibitors by screening pre-existing compound libraries. Currently, the NCI library is the most commonly used database for screening kinase inhibitors. Our research suggests that other libraries, such as MLSMR, are more kinase-like and should be given priority in kinase screenings.Journal of Cheminformatics 02/2012; 4(1):4. · 3.42 Impact Factor -
Article: Exploring polypharmacology using a ROCS-based target fishing approach.
[show abstract] [hide abstract]
ABSTRACT: Polypharmacology has emerged as a new theme in drug discovery. In this paper, we studied polypharmacology using a ligand-based target fishing (LBTF) protocol. To implement the protocol, we first generated a chemogenomic database that links individual protein targets with a specified set of drugs or target representatives. Target profiles were then generated for a given query molecule by computing maximal shape/chemistry overlap between the query molecule and the drug sets assigned to each protein target. The overlap was computed using the program ROCS (Rapid Overlay of Chemical Structures). We validated this approach using the Directory of Useful Decoys (DUD). DUD contains 2950 active compounds, each with 36 property-matched decoys, against 40 protein targets. We chose a set of known drugs to represent each DUD target, and we carried out ligand-based virtual screens using data sets of DUD actives seeded into DUD decoys for each target. We computed Receiver Operator Characteristic (ROC) curves and associated area under the curve (AUC) values. For the majority of targets studied, the AUC values were significantly better than for the case of a random selection of compounds. In a second test, the method successfully identified off-targets for drugs such as rimantadine, propranolol, and domperidone that were consistent with those identified by recent experiments. The results from our ROCS-based target fishing approach are promising and have potential application in drug repurposing for single and multiple targets, identifying targets for orphan compounds, and adverse effect prediction.Journal of Chemical Information and Modeling 12/2011; 52(2):492-505. · 4.68 Impact Factor -
Article: Classification of scaffold-hopping approaches.
[show abstract] [hide abstract]
ABSTRACT: The general goal of drug discovery is to identify novel compounds that are active against a preselected biological target with acceptable pharmacological properties defined by marketed drugs. Scaffold hopping has been widely applied by medicinal chemists to discover equipotent compounds with novel backbones that have improved properties. In this article we classify scaffold hopping into four major categories, namely heterocycle replacements, ring opening or closure, peptidomimetics and topology-based hopping. We review the structural diversity of original and final scaffolds with respect to each category. We discuss the advantages and limitations of small, medium and large-step scaffold hopping. Finally, we summarize software that is frequently used to facilitate different kinds of scaffold-hopping methods.Drug discovery today 10/2011; 17(7-8):310-24. · 6.63 Impact Factor
Top Journals
Institutions
-
2012
-
Biotechnology High Performance Computing Software Applications Institute
Frederick, MD, USA
-
-
2011
-
National Institutes of Health
Bethesda, MD, USA
-