Christine Humblet

Cubist Pharmaceuticals, Lexington, MA, USA

Are you Christine Humblet?

Claim your profile

Publications (25)68.47 Total impact

  • Article: Chemical space sampling in virtual screening by different crystal structures.
    Natasja Brooijmans, Christine Humblet
    [show abstract] [hide abstract]
    ABSTRACT: Retrospective virtual screening experiments were carried out to investigate the effects of combining hit lists from different crystal structures of the same target using consensus scoring. An in-house High Throughput Screening (HTS) dataset from PI3K-γ was used and docked against five diverse PI3K-γ crystal structures. The results show that consensus scoring prioritizes compounds that score moderately against individual crystal structures and is thus complementary to individual crystal structure screening leading to an increase in the diversity of hits. Enrichment factors (EFs) of the consensus score for two or three structures are often as high as or higher than the EF of the individual structures used in the consensus score. Combining four or five structures in the consensus score generally yields lower enrichments. Compounds in the top 500 of the consensus score that are also found in the top 500 of an individual X-ray structure used in the consensus score calculations yield the largest number of hits with the lowest number of false positives.
    Chemical Biology &amp Drug Design 10/2010; 76(6):472-9. · 2.28 Impact Factor
  • Article: Biased retrieval of chemical series in receptor-based virtual screening.
    Natasja Brooijmans, Jason B Cross, Christine Humblet
    [show abstract] [hide abstract]
    ABSTRACT: Using the kinases in the DUD dataset and an in-house HTS dataset from PI3K-γ, receptor-based virtual screening experiments were performed using Glide SP docking. While significant enrichments were observed for eight of the nine targets in the set, more detailed analyses highlighted that much of the early enrichment (10-80%) is the result of retrieval of a single cluster of active compounds. This biased retrieval was not necessarily due to early enrichment of the cluster containing the co-crystallized ligand. Virtual screening validation studies could thus benefit from including cluster-based analyses to assess enrichment of diverse chemotypes.
    Journal of Computer-Aided Molecular Design 10/2010; 24(12):1053-62. · 3.39 Impact Factor
  • Article: Computational alanine scanning with linear scaling semiempirical quantum mechanical methods.
    [show abstract] [hide abstract]
    ABSTRACT: Alanine scanning is a powerful experimental tool for understanding the key interactions in protein-protein interfaces. Linear scaling semiempirical quantum mechanical calculations are now sufficiently fast and robust to allow meaningful calculations on large systems such as proteins, RNA and DNA. In particular, they have proven useful in understanding protein-ligand interactions. Here we ask the question: can these linear scaling quantum mechanical methods developed for protein-ligand scoring be useful for computational alanine scanning? To answer this question, we assembled 15 protein-protein complexes with available crystal structures and sufficient alanine scanning data. In all, the data set contains Delta Delta Gs for 400 single point alanine mutations of these 15 complexes. We show that with only one adjusted parameter the quantum mechanics-based methods outperform both buried accessible surface area and a potential of mean force and compare favorably to a variety of published empirical methods. Finally, we closely examined the outliers in the data set and discuss some of the challenges that arise from this examination.
    Proteins Structure Function and Bioinformatics 08/2010; 78(10):2329-37. · 3.39 Impact Factor
  • Article: A protein relational database and protein family knowledge bases to facilitate structure-based design analyses.
    [show abstract] [hide abstract]
    ABSTRACT: The Protein Data Bank is the most comprehensive source of experimental macromolecular structures. It can, however, be difficult at times to locate relevant structures with the Protein Data Bank search interface. This is particularly true when searching for complexes containing specific interactions between protein and ligand atoms. Moreover, searching within a family of proteins can be tedious. For example, one cannot search for some conserved residue as residue numbers vary across structures. We describe herein three databases, Protein Relational Database, Kinase Knowledge Base, and Matrix Metalloproteinase Knowledge Base, containing protein structures from the Protein Data Bank. In Protein Relational Database, atom-atom distances between protein and ligand have been precalculated allowing for millisecond retrieval based on atom identity and distance constraints. Ring centroids, centroid-centroid and centroid-atom distances and angles have also been included permitting queries for pi-stacking interactions and other structural motifs involving rings. Other geometric features can be searched through the inclusion of residue pair and triplet distances. In Kinase Knowledge Base and Matrix Metalloproteinase Knowledge Base, the catalytic domains have been aligned into common residue numbering schemes. Thus, by searching across Protein Relational Database and Kinase Knowledge Base, one can easily retrieve structures wherein, for example, a ligand of interest is making contact with the gatekeeper residue.
    Chemical Biology &amp Drug Design 08/2010; 76(2):142-53. · 2.28 Impact Factor
  • Article: Insights for predicting blood-brain barrier penetration of CNS targeted molecules using QSPR approaches.
    [show abstract] [hide abstract]
    ABSTRACT: Due to the high attrition rate of central nervous system drug candidates during clinical trials, the assessment of blood-brain barrier (BBB) penetration in early research is particularly important. A genetic approximation (GA)-based regression model was developed for predicting in vivo blood-brain partitioning data, expressed as logBB (log[brain]/[blood]). The model was built using an in-house data set of 193 compounds assembled from 22 different therapeutic projects. The final model (cross-validated r(2) = 0.72) with five molecular descriptors was selected based on validation using several large internal and external test sets. We demonstrate the potential utility of the model by applying it to a set of literature reported secretase inhibitors. In addition, we describe a rule-based approach for rapid assessment of brain penetration with several simple molecular descriptors.
    Journal of Chemical Information and Modeling 06/2010; 50(6):1123-33. · 4.68 Impact Factor
  • Article: Chemical space sampling by different scoring functions and crystal structures.
    Natasja Brooijmans, Christine Humblet
    [show abstract] [hide abstract]
    ABSTRACT: Virtual screening has become a popular tool to identify novel leads in the early phases of drug discovery. A variety of docking and scoring methods used in virtual screening have been the subject of active research in an effort to gauge limitations and articulate best practices. However, how to best utilize different scoring functions and various crystal structures, when available, is not yet well understood. In this work we use multiple crystal structures of PI3 K-gamma in both prospective and retrospective virtual screening experiments. Both Glide SP scoring and Prime MM-GBSA rescoring are utilized in the prospective and retrospective virtual screens, and consensus scoring is investigated in the retrospective virtual screening experiments. The results show that each of the different crystal structures that was used, samples a different chemical space, i.e. different chemotypes are prioritized by each structure. In addition, the different (re)scoring functions prioritize different chemotypes as well. Somewhat surprisingly, the Prime MM-GBSA scoring function generally gives lower enrichments than Glide SP. Finally we investigate the impact of different ligand preparation protocols on virtual screening enrichment factors. In summary, different crystal structures and different scoring functions are complementary to each other and allow for a wider variety of chemotypes to be considered for experimental follow-up.
    Journal of Computer-Aided Molecular Design 05/2010; 24(5):433-47. · 3.39 Impact Factor
  • Article: Using a homology model of cytochrome P450 2D6 to predict substrate site of metabolism.
    [show abstract] [hide abstract]
    ABSTRACT: CYP2D6 is an important enzyme that is involved in first pass metabolism and is responsible for metabolizing ~25% of currently marketed drugs. A homology model of CYP2D6 was built using X-ray structures of ligand-bound CYP2C5 complexes as templates. This homology model was used in docking studies to rationalize and predict the site of metabolism of known CYP2D6 substrates. While the homology model was generally found to be in good agreement with the recently solved apo (ligand-free) X-ray structure of CYP2D6, significant differences between the structures were observed in the B' and F-G helical region. These structural differences are similar to those observed between ligand-free and ligand-bound structures of other CYPs and suggest that these conformational changes result from induced-fit adaptations upon ligand binding. By docking to the homology model using Glide, it was possible to identify the correct site of metabolism for a set of 16 CYP2D6 substrates 85% of the time when the 5 top scoring poses were examined. On the other hand, docking to the apo CYP2D6 X-ray structure led to a loss in accuracy in predicting the sites of metabolism for many of the CYP2D6 substrates considered in this study. These results demonstrate the importance of describing substrate-induced conformational changes that occur upon binding. The best results were obtained using Glide SP with van der Waals scaling set to 0.8 for both the receptor and ligand atoms. A discussion of putative binding modes that explain the distribution of metabolic sites for substrates, as well as a relationship between the number of metabolic sites and substrate size, are also presented. In addition, analysis of these binding modes enabled us to rationalize the typical hydroxylation and O-demethylation reactions catalyzed by CYP2D6 as well as the less common N-dealkylation.
    Journal of Computer-Aided Molecular Design 04/2010; 24(3):237-56. · 3.39 Impact Factor
  • Article: An enriched structural kinase database to enable kinome-wide structure-based analyses and drug discovery.
    [show abstract] [hide abstract]
    ABSTRACT: The development of a kinase structural database, the kinase knowledge base (KKB), is described. It covers all human kinase domain structures that have been deposited in the Protein Data Bank. All structures are renumbered using a common scheme, which enables efficient cross-comparisons and multiple queries of interest to the kinase field. The common numbering scheme is also used to automatically annotate conserved residues and motifs, and conformationally classify the structures based on the DFG-loop and Helix C. Analyses of residue conservation in the ATP binding site using the full human-kinome-sequence alignment lead to the identification of a conserved hydrogen bond between the hinge region backbone and a glycine in the specificity surface. Furthermore, 90% of kinases are found to have at least one stabilizing interaction for the hinge region, which has not been described before.
    Protein Science 02/2010; 19(4):763-74. · 2.80 Impact Factor
  • Article: Development of QSAR models for microsomal stability: identification of good and bad structural features for rat, human and mouse microsomal stability.
    Journal of Computer-Aided Molecular Design. 01/2010; 24:23-35.
  • Article: Escape from flatland: increasing saturation as an approach to improving clinical success.
    Frank Lovering, Jack Bikker, Christine Humblet
    [show abstract] [hide abstract]
    ABSTRACT: The medicinal chemistry community has become increasingly aware of the value of tracking calculated physical properties such as molecular weight, topological polar surface area, rotatable bonds, and hydrogen bond donors and acceptors. We hypothesized that the shift to high-throughput synthetic practices over the past decade may be another factor that may predispose molecules to fail by steering discovery efforts toward achiral, aromatic compounds. We have proposed two simple and interpretable measures of the complexity of molecules prepared as potential drug candidates. The first is carbon bond saturation as defined by fraction sp(3) (Fsp(3)) where Fsp(3) = (number of sp(3) hybridized carbons/total carbon count). The second is simply whether a chiral carbon exists in the molecule. We demonstrate that both complexity (as measured by Fsp(3)) and the presence of chiral centers correlate with success as compounds transition from discovery, through clinical testing, to drugs. In an attempt to explain these observations, we further demonstrate that saturation correlates with solubility, an experimental physical property important to success in the drug discovery setting.
    Journal of Medicinal Chemistry 11/2009; 52(21):6752-6. · 4.80 Impact Factor
  • Article: Development of QSAR models for microsomal stability: identification of good and bad structural features for rat, human and mouse microsomal stability.
    [show abstract] [hide abstract]
    ABSTRACT: High throughput microsomal stability assays have been widely implemented in drug discovery and many companies have accumulated experimental measurements for thousands of compounds. Such datasets have been used to develop in silico models to predict metabolic stability and guide the selection of promising candidates for synthesis. This approach has proven most effective when selecting compounds from proposed virtual libraries prior to synthesis. However, these models are not easily interpretable at the structural level, and thus provide little insight to guide traditional synthetic efforts. We have developed global classification models of rat, mouse and human liver microsomal stability using in-house data. These models were built with FCFP_6 fingerprints using a Naïve Bayesian classifier within Pipeline Pilot. The test sets were correctly classified as stable or unstable with satisfying accuracies of 78, 77 and 75% for rat, human and mouse models, respectively. The prediction confidence was assigned using the Bayesian score to assess the applicability of the models. Using the resulting models, we developed a novel data mining strategy to identify structural features associated with good and bad microsomal stability. We also used this approach to identify structural features which are good for one species but bad for another. With these findings, the structure-metabolism relationships are likely to be understood faster and earlier in drug discovery.
    Journal of Computer-Aided Molecular Design 11/2009; 24(1):23-35. · 3.39 Impact Factor
  • Article: Computation of 3D queries for ROCS based virtual screens.
    Gregory J Tawa, J Christian Baber, Christine Humblet
    [show abstract] [hide abstract]
    ABSTRACT: Rapid overlay of chemical structures (ROCS) is a method that aligns molecules based on shape and/or chemical similarity. It is often used in 3D ligand-based virtual screening. Given a query consisting of a single conformation of an active molecule ROCS can generate highly enriched hit lists. Typically the chosen query conformation is a minimum energy structure. Can better enrichment be obtained using conformations other than the minimum energy structure? To answer this question a methodology has been developed called CORAL (COnformational analysis, Rocs ALignment). For a given set of molecule conformations it computes optimized conformations for ROCS screening. It does so by clustering all conformations of a chosen molecule set using pairwise ROCS combo scores. The best representative conformation is that which has the highest average overlap with the rest of the conformations in the cluster. It is these best representative conformations that are then used for virtual screening. CORAL was tested by performing virtual screening experiments with the 40 DUD (Directory of Useful Decoys) data sets. Both CORAL and minimum energy queries were used. The recognition capability of each query was quantified as the area under the ROC curve (AUC). Results show that the CORAL AUC values are on average larger than the minimum energy AUC values. This demonstrates that one can indeed obtain better ROCS enrichments with conformations other than the minimum energy structure. As a result, CORAL analysis can be a valuable first step in virtual screening workflows using ROCS.
    Journal of Computer-Aided Molecular Design 09/2009; 23(12):853-68. · 3.39 Impact Factor
  • Article: GARD: a Generally Applicable Replacement for RMSD.
    [show abstract] [hide abstract]
    ABSTRACT: The root-mean-squared deviation (rmsd) is a widely used measure of distance between two aligned objects -- often chemical structures. However, rmsd has a number of known limitations including difficulty of interpretation, no limit on weighting for any portion of the alignment, and a lack of normalization. In this work, a Generally Applicable Replacement for rmsD (GARD) is proposed. In this implementation atomic contributions are weighted by their relative importance to binding, as determined statistically by Andrews et al. (1) , and as such this method is 'chemically aware'. This novel measure is normalized and does not have many of the failings of traditional rmsd. It is, thus, perfectly suited for a wide variety of uses, including the assessment of the quality of poses produced from molecular docking programs and the comparison of conformers. Rmsd and GARD are compared in their ability to assess docking software and multiple examples of the use of GARD to rescue essentially correct poses with a high rmsd are presented.
    Journal of Chemical Information and Modeling 08/2009; 49(8):1889-900. · 4.68 Impact Factor
  • Article: Modeling G protein-coupled receptors for structure-based drug discovery using low-frequency normal modes for refinement of homology models: application to H3 antagonists.
    [show abstract] [hide abstract]
    ABSTRACT: G Protein-Coupled Receptors (GPCRs) are integral membrane proteins that play important role in regulating key physiological functions, and are targets of about 50% of all recently launched drugs. High-resolution experimental structures are available only for very few GPCRs. As a result, structure-based drug design efforts for GPCRs continue to rely on in silico modeling, which is considered to be an extremely difficult task especially for these receptors. Here, we describe Gmodel, a novel approach for building 3D atomic models of GPCRs using a normal mode-based refinement of homology models. Gmodel uses a small set of relevant low-frequency vibrational modes derived from Random Elastic Network model to efficiently sample the large-scale receptor conformation changes and generate an ensemble of alternative models. These are used to assemble receptor-ligand complexes by docking a known active into each of the alternative models. Each of these is next filtered using restraints derived from known mutation and binding affinity data and is refined in the presence of the active ligand. In this study, Gmodel was applied to generate models of the antagonist form of histamine 3 (H3) receptor. The validity of this novel modeling approach is demonstrated by performing virtual screening (using the refined models) that consistently produces highly enriched hit lists. The models are further validated by analyzing the available SAR related to classical H3 antagonists, and are found to be in good agreement with the available experimental data, thus providing novel insights into the receptor-ligand interactions.
    Proteins Structure Function and Bioinformatics 08/2009; 78(2):457-73. · 3.39 Impact Factor
  • Article: Comparison of several molecular docking programs: pose prediction and virtual screening accuracy.
    [show abstract] [hide abstract]
    ABSTRACT: Molecular docking programs are widely used modeling tools for predicting ligand binding modes and structure based virtual screening. In this study, six molecular docking programs (DOCK, FlexX, GLIDE, ICM, PhDOCK, and Surflex) were evaluated using metrics intended to assess docking pose and virtual screening accuracy. Cognate ligand docking to 68 diverse, high-resolution X-ray complexes revealed that ICM, GLIDE, and Surflex generated ligand poses close to the X-ray conformation more often than the other docking programs. GLIDE and Surflex also outperformed the other docking programs when used for virtual screening, based on mean ROC AUC and ROC enrichment values obtained for the 40 protein targets in the Directory of Useful Decoys (DUD). Further analysis uncovered general trends in accuracy that are specific for particular protein families. Modifying basic parameters in the software was shown to have a significant effect on docking and virtual screening results, suggesting that expert knowledge is critical for optimizing the accuracy of these methods.
    Journal of Chemical Information and Modeling 07/2009; 49(6):1455-74. · 4.68 Impact Factor
  • Article: CONFIRM: connecting fragments found in receptor molecules.
    [show abstract] [hide abstract]
    ABSTRACT: A novel algorithm for the connecting of fragment molecules is presented and validated for a number of test systems. Within the CONFIRM (Connecting Fragments Found in Receptor Molecules) approach a pre-prepared library of bridges is searched to extract those which match a search criterion derived from known experimental or computational binding information about fragment molecules within a target binding site. The resulting bridge 'hits' are then connected, in an automated fashion, to the fragments and docked into the target receptor. Docking poses are assessed in terms of root-mean-squared deviation from the known positions of the fragment molecules, as well as docking score should known inhibitors be available. The creation of the bridge library, the full details and novelty of the CONFIRM algorithm, and the general applicability of this approach within the field of fragment-based de novo drug design are discussed.
    Journal of Computer-Aided Molecular Design 08/2008; 22(10):761-72. · 3.39 Impact Factor
  • Article: Investigation of MM-PBSA rescoring of docking poses.
    [show abstract] [hide abstract]
    ABSTRACT: Target-based virtual screening is increasingly used to generate leads for targets for which high quality three-dimensional (3D) structures are available. To allow large molecular databases to be screened rapidly, a tiered scoring scheme is often employed whereby a simple scoring function is used as a fast filter of the entire database and a more rigorous and time-consuming scoring function is used to rescore the top hits to produce the final list of ranked compounds. Molecular mechanics Poisson-Boltzmann surface area (MM-PBSA) approaches are currently thought to be quite effective at incorporating implicit solvation into the estimation of ligand binding free energies. In this paper, the ability of a high-throughput MM-PBSA rescoring function to discriminate between correct and incorrect docking poses is investigated in detail. Various initial scoring functions are used to generate docked poses for a subset of the CCDC/Astex test set and to dock one set of actives/inactives from the DUD data set. The effectiveness of each of these initial scoring functions is discussed. Overall, the ability of the MM-PBSA rescoring function to (i) regenerate the set of X-ray complexes when docking the bound conformation of the ligand, (ii) regenerate the X-ray complexes when docking conformationally expanded databases for each ligand which include "conformation decoys" of the ligand, and (iii) enrich known actives in a virtual screen for the mineralocorticoid receptor in the presence of "ligand decoys" is assessed. While a pharmacophore-based molecular docking approach, PhDock, is used to carry out the docking, the results are expected to be general to use with any docking method.
    Journal of Chemical Information and Modeling 06/2008; 48(5):1081-91. · 4.68 Impact Factor
  • Article: Lead optimization via high-throughput molecular docking.
    [show abstract] [hide abstract]
    ABSTRACT: Structure-based lead optimization approaches are increasingly playing a role in the drug-discovery process. Recent advances in 'high-throughput' molecular docking methods and examples of their successful use in lead optimization are reviewed. Measures of docking accuracy, scoring function comparisons, and consensus approaches are discussed. Differences in docking protocols typically used for lead optimization versus lead generation are highlighted; this section includes a discussion of the latest methods for the incorporation of protein flexibility. New approaches developed specifically for the design of combinatorial libraries as well as those designed or used for 'fragment' versus lead optimization are presented. Finally, potential future improvements to the technology are outlined.
    Current opinion in drug discovery & development 06/2007; 10(3):264-74. · 4.20 Impact Factor
  • Article: RNA unrestrained molecular dynamics ensemble improves agreement with experimental NMR data compared to single static structure: a test case.
    [show abstract] [hide abstract]
    ABSTRACT: Nuclear magnetic resonance (NMR) provides structural and dynamic information reflecting an average, often non-linear, of multiple solution-state conformations. Therefore, a single optimized structure derived from NMR refinement may be misleading if the NMR data actually result from averaging of distinct conformers. It is hypothesized that a conformational ensemble generated by a valid molecular dynamics (MD) simulation should be able to improve agreement with the NMR data set compared with the single optimized starting structure. Using a model system consisting of two sequence-related self-complementary ribonucleotide octamers for which NMR data was available, 0.3 ns particle mesh Ewald MD simulations were performed in the AMBER force field in the presence of explicit water and counterions. Agreement of the averaged properties of the molecular dynamics ensembles with NMR data such as homonuclear proton nuclear Overhauser effect (NOE)-based distance constraints, homonuclear proton and heteronuclear (1)H-(31)P coupling constant (J) data, and qualitative NMR information on hydrogen bond occupancy, was systematically assessed. Despite the short length of the simulation, the ensemble generated from it agreed with the NMR experimental constraints more completely than the single optimized NMR structure. This suggests that short unrestrained MD simulations may be of utility in interpreting NMR results. As expected, a 0.5 ns simulation utilizing a distance dependent dielectric did not improve agreement with the NMR data, consistent with its inferior exploration of conformational space as assessed by 2-D RMSD plots. Thus, ability to rapidly improve agreement with NMR constraints may be a sensitive diagnostic of the MD methods themselves.
    Journal of Computer-Aided Molecular Design 06/2006; 20(5):263-79. · 3.39 Impact Factor
  • Article: CLIP: Similarity Searching of 3D Databases Using Clique Detection.
    Journal of Chemical Information and Computer Sciences. 01/2003; 43:443-448.