Mathias Wawer

Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, North Rhine-Westphalia, Germany

Are you Mathias Wawer?

Claim your profile

Publications (14)59.17 Total impact

  • Article: Local structural changes, global data views: graphical substructure-activity relationship trailing.
    Mathias Wawer, Jürgen Bajorath
    [show abstract] [hide abstract]
    ABSTRACT: The systematic extraction of structure-activity relationship (SAR) information from large and diverse compound data sets depends on the application of computational analysis methods. Irrespective of the methodological details, the ultimate goal of large-scale SAR analysis is to identify most informative compounds and rationalize structural changes that determine SAR behavior. Such insights provide a basis for further chemical exploration. Herein we introduce the first graphical SAR analysis method that globally organizes large compound data sets on the basis of local structural relationships, hence providing an immediate access to important structural modifications and SAR determinants.
    Journal of Medicinal Chemistry 03/2011; 54(8):2944-51. · 4.80 Impact Factor
  • Article: Rationalizing the role of SAR tolerance for ligand-based virtual screening.
    [show abstract] [hide abstract]
    ABSTRACT: It is well appreciated that the results of ligand-based virtual screening (LBVS) are much influenced by methodological details, given the generally strong compound class dependence of LBVS methods. It is less well understood to what extent structure-activity relationship (SAR) characteristics might influence the outcome of LBVS. We have assessed the hypothesis that the success of prospective LBVS depends on the SAR tolerance of screening targets, in addition to methodological aspects. In this context, SAR tolerance is rationalized as the ability of a target protein to specifically interact with series of structurally diverse active compounds. In compound data sets, SAR tolerance articulates itself as SAR continuity, i.e., the presence of structurally diverse compounds having similar potency. In order to analyze the role of SAR tolerance for LBVS, activity landscape representations of compounds active against 16 different target proteins were generated for which successful LBVS applications were reported. In all instances, the activity landscapes of known active compounds contained multiple regions of local SAR continuity. When analyzing the location of newly identified LBVS hits and their SAR environments, we found that these hits almost exclusively mapped to regions of distinct local SAR continuity. Taken together, these findings indicate the presence of a close link between SAR tolerance at the target level, SAR continuity at the ligand level, and the probability of LBVS success.
    Journal of Chemical Information and Modeling 03/2011; 51(4):837-42. · 4.68 Impact Factor
  • Article: Extracting SAR Information from a Large Collection of Anti-Malarial Screening Hits by NSG-SPT Analysis
    Mathias Wawer, Jürgen Bajorath
    [show abstract] [hide abstract]
    ABSTRACT: We combine two graphical SAR analysis methods, Network-like Similarity Graphs (NSGs) and Similarity-Potency Trees (SPTs), to search for SAR information in a large and heterogeneous compound data set containing more than 13,000 antimalarial screening hits that was recently released by GlaxoSmithKline (GSK). The NSG-SPT approach first identifies subsets of compounds inducing local SAR discontinuity in data sets and then extracts available SAR information from these subsets in a graphically intuitive manner. Applying the NSG-SPT analysis scheme, we have identified in the GSK collection compound subsets of high local SAR information content including both known and previously unknown antimalarial chemotypes, which yielded interpretable SAR patterns. This information should be helpful to prioritize and select antimalarial candidate compounds for further chemical exploration. Furthermore, the NSG-SPT tools are publicly available, and our study also shows how to practically apply these SAR analysis methods to study large compound data sets.Keywords (keywords): Anti-malaria screening hits; data mining; structure−activity relationship (SAR) information; graphical SAR analysis; network-like similarity graphs; similarity-potency trees
    01/2011;
  • Article: Design of multitarget activity landscapes that capture hierarchical activity cliff distributions.
    [show abstract] [hide abstract]
    ABSTRACT: An activity landscape model of a compound data set can be rationalized as a graphical representation that integrates molecular similarity and potency relationships. Activity landscape representations of different design are utilized to aid in the analysis of structure-activity relationships and the selection of informative compounds. Activity landscape models reported thus far focus on a single target (i.e., a single biological activity) or at most two targets, giving rise to selectivity landscapes. For compounds active against more than two targets, landscapes representing multitarget activities are difficult to conceptualize and have not yet been reported. Herein, we present a first activity landscape design that integrates compound potency relationships across multiple targets in a formally consistent manner. These multitarget activity landscapes are based on a general activity cliff classification scheme and are visualized in graph representations, where activity cliffs are represented as edges. Furthermore, the contributions of individual compounds to structure-activity relationship discontinuity across multiple targets are monitored. The methodology has been applied to derive multitarget activity landscapes for compound data sets active against different target families. The resulting landscapes identify single-, dual-, and triple-target activity cliffs and reveal the presence of hierarchical cliff distributions. From these multitarget activity landscapes, compounds forming complex activity cliffs can be readily selected.
    Journal of Chemical Information and Modeling 01/2011; 51(2):258-66. · 4.68 Impact Factor
  • Article: Activity Landscape Representations for Structure-Activity Relationship Analysis.
    Journal of Medicinal Chemistry 09/2010; · 4.80 Impact Factor
  • Article: Similarity-potency trees: a method to search for SAR information in compound data sets and derive SAR rules.
    Mathias Wawer, Jürgen Bajorath
    [show abstract] [hide abstract]
    ABSTRACT: An intuitive and generally applicable analysis method, termed similarity-potency tree (SPT), is introduced to mine structure-activity relationship (SAR) information in compound data sets of any source. Only compound potency values and nearest-neighbor similarity relationships are considered. Rather than analyzing a data set as a whole, in part overlapping compound neighborhoods are systematically generated and represented as SPTs. This local analysis scheme simplifies the evaluation of SAR information and SPTs of high SAR information content are easily identified. By inspecting only a limited number of compound neighborhoods, it is also straightforward to determine whether data sets contain only little or no interpretable SAR information. Interactive analysis of SPTs is facilitated by reading the trees in two directions, which makes it possible to extract SAR rules, if available, in a consistent manner. The simplicity and interpretability of the data structure and the ease of calculation are characteristic features of this approach. We apply the methodology to high-throughput screening and lead optimization data sets, compare the approach to standard clustering techniques, illustrate how SAR rules are derived, and provide some practical guidance how to best utilize the methodology. The SPT program is made freely available to the scientific community.
    Journal of Chemical Information and Modeling 08/2010; 50(8):1395-409. · 4.68 Impact Factor
  • Source
    Article: Data structures and computational tools for the extraction of SAR information from large compound sets.
    [show abstract] [hide abstract]
    ABSTRACT: Computational data mining and visualization techniques play a central part in the extraction of structure-activity relationship (SAR) information from compound sets including high-throughput screening data. Standard statistical and classification techniques can be used to organize data sets and evaluate the chemical neighborhood of potent hits; however, such methods are limited in their ability to extract complex SAR patterns from data sets and make them readily accessible to medicinal chemists. Therefore, new approaches and data structures are being developed that explicitly focus on molecular structure and its relationship to biological activity across multiple targets. Here, we review standard techniques for compound data analysis and describe new data structures and computational tools for SAR mining of large compound data sets.
    Drug discovery today 08/2010; 15(15-16):630-9. · 6.63 Impact Factor
  • Article: SARANEA: a freely available program to mine structure-activity and structure-selectivity relationship information in compound data sets.
    [show abstract] [hide abstract]
    ABSTRACT: We introduce SARANEA, an open-source Java application for interactive exploration of structure-activity relationship (SAR) and structure-selectivity relationship (SSR) information in compound sets of any source. SARANEA integrates various SAR and SSR analysis functions and utilizes a network-like similarity graph data structure for visualization. The program enables the systematic detection of activity and selectivity cliffs and corresponding key compounds across multiple targets. Advanced SAR analysis functions implemented in SARANEA include, among others, layered chemical neighborhood graphs, cliff indices, selectivity trees, editing functions for molecular networks and pathways, bioactivity summaries of key compounds, and markers for bioactive compounds having potential side effects. We report the application of SARANEA to identify SAR and SSR determinants in different sets of serine protease inhibitors. It is found that key compounds can influence SARs and SSRs in rather different ways. Such compounds and their SAR/SSR characteristics can be systematically identified and explored using SARANEA. The program and source code are made freely available under the GNU General Public License.
    Journal of Chemical Information and Modeling 01/2010; 50(1):68-78. · 4.68 Impact Factor
  • Article: Extraction of structure-activity relationship information from high-throughput screening data.
    Mathias Wawer, Jürgen Bajorath
    [show abstract] [hide abstract]
    ABSTRACT: The wealth of biological screening data that is generated poses substantial problems to medicinal chemistry. A key question becomes how to best prioritize and select hits for further evaluation from the many weakly active compounds that are typically identified in HTS campaigns. Such decisions can be substantially supported if it is possible to evaluate preliminary structure-activity relationship (SAR) information that might be contained in screening data. If SAR information can be extracted from screening data, one can attempt to estimate the chemical optimization potential of hits. We will discuss different types of approaches that have been developed to facilitate HTS data analysis, with special emphasis on recent methods to explore SAR information contained in screening sets.
    Current Medicinal Chemistry 09/2009; 16(31):4049-57. · 4.86 Impact Factor
  • Article: Systematic extraction of structure-activity relationship information from biological screening data.
    Mathias Wawer, Jürgen Bajorath
    [show abstract] [hide abstract]
    ABSTRACT: A data mining approach is introduced that automatically extracts SAR information from high-throughput screening data sets and that helps to select active compounds for chemical exploration and hit-to-lead projects. SAR pathways are systematically identified consisting of sequences of similar active compounds with gradual increases in potency. Fully enumerated SAR pathway sets are subjected to pathway scoring, filtering, and mining, and pathways with the most significant SAR information content are prioritized. High-scoring SAR pathways often reveal activity cliffs contained in screening data. Subsets of SAR pathways are analyzed in SAR trees that make it possible to identify microenvironments of significant SAR discontinuity from which hits are preferentially selected. SAR trees of alternative pathways leading to activity cliffs identify key compounds and help to develop chemically intuitive SAR hypotheses.
    ChemMedChem 08/2009; 4(9):1431-8. · 3.15 Impact Factor
  • Article: Navigating structure-activity landscapes.
    [show abstract] [hide abstract]
    ABSTRACT: The problem of how to explore structure-activity relationships (SARs) systematically is still largely unsolved in medicinal chemistry. Recently, data analysis tools have been introduced to navigate activity landscapes and to assess SARs on a large scale. Initial investigations reveal a surprising heterogeneity among SARs and shed light on the relationship between 'global' and 'local' SAR features. Moreover, insights are provided into the fundamental issue of why modeling tools work well in some cases, but not in others.
    Drug discovery today 06/2009; 14(13-14):698-705. · 6.63 Impact Factor
  • Article: Elucidation of structure-activity relationship pathways in biological screening data.
    [show abstract] [hide abstract]
    ABSTRACT: A computational molecular network analysis of various high-throughput screening (HTS) data sets including inhibition assays and cell-based screens organizes screening hits according to different local structure-activity relationships (SARs). The resulting network representations make it possible to focus on different local SAR environments in screening data. We have designed a simple scoring function accounting for similarity and potency relationships among hits that identifies SAR pathways leading from active compounds in different SAR contexts to key compounds forming activity cliffs. From these pathways, SAR information can be extracted and utilized to select hits for further analysis. In clusters of hits related by different local SARs, alternative pathways can be systematically explored and ranked according to SAR information content, which makes it possible to prioritize hits in a consistent manner.
    Journal of Medicinal Chemistry 02/2009; 52(4):1075-80. · 4.80 Impact Factor
  • Article: Structure-activity relationship anatomy by network-like similarity graphs and local structure-activity relationship indices.
    [show abstract] [hide abstract]
    ABSTRACT: The study of structure-activity relationships (SARs) of small molecules is of fundamental importance in medicinal chemistry and drug design. Here, we introduce an approach that combines the analysis of similarity-based molecular networks and SAR index distributions to identify multiple SAR components present within sets of active compounds. Different compound classes produce molecular networks of distinct topology. Subsets of compounds related by different local SARs are often organized in small communities in networks annotated with potency information. Many local SAR communities are not isolated but connected by chemical bridges, i.e., similar molecules occurring in different local SAR contexts. The analysis makes it possible to relate local and global SAR features to each other and identify key compounds that are major determinants of SAR characteristics. In many instances, such compounds represent start and end points of chemical optimization pathways and aid in the selection of other candidates from their communities.
    Journal of Medicinal Chemistry 10/2008; 51(19):6075-84. · 4.80 Impact Factor
  • Article: Systematic extraction of structure-activity relationship information from biological screening data
    Mathias Wawer, L Peltason, Jürgen Bajorath