Protein Structure Similarity Clustering: Dynamic Treatment of PDB Structures Facilitates Clustering

Department of Chemistry, University of Nebraska at Lincoln, Lincoln, Nebraska, United States
Angewandte Chemie International Edition (Impact Factor: 11.26). 11/2006; 45(46):7766-70. DOI: 10.1002/anie.200602125
Source: PubMed


Protein structure similarity clustering (PSSC) [1]– [3] is one of a number of potential guiding principles [4], [5] that have been introduced to focus combinatorial-library design/ protein targeting. PSSC clusters protein targets with similar ligand-binding cores in which little sequence or functional similarity is evident. Lead compounds for one member of the cluster then provide novel starting points in chemical space for ligand development for other members of the PSSC. We describe herein a new clustering procedure that lends itself to ligand docking, molecular dynamics (MD), and the vector-alignment-search-tool (VAST) [6] algorithm. This MD-assisted approach offers an alternative to the serial structure retrieval/ inspection steps used to generate the original protein structure similarity cluster, centered on Cdc25A. [2] Furthermore, in a particularly difficult de novo test, the mannose 6- phosphate/insulin-like growth factor II receptor (M6P/IGF2R), lends itself to clustering, with MD opening up connections to partners that the static crystal structure coordinates fail to find. This version includes 15 pages of Supporting Information not published with the printed edition of the article in Angewandte Chemie.

Download full-text


Available from: David B Berkowitz
  • Source
    • "Protein sequence clustering methods can be classified according to two aspects: the clustering algorithm type used, which is usually either hierarchical or graph-based; and the criterion used to group sequences, which is usually either domain-based or family-based. On structural genomics studies, Charette et al. (2006) used protein clustering for protein ligand-docking and molecular dynamics, whereas Shen et al. (2005) used it for protein class prediction, and Yan and Moult (2005) searched for representative family structural templates. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Motivation: Protein sequence data is growing at an expo- nential rate. However a considerable portion of this data is redundant, with many new sequences being very similar to others in the databases. While clustering has been used to reduce this redundancy, the influence of sequence similarity in the functional quality of the clusters is still unclear. Results: In this work, we introduce a greedy graph-based clustering algorithm, which is tested using the Swiss-Prot database. We study the topology of the protein space as function of the threshold BLAST e-values, and the functional characterization of the clusters using the Gene Ontology. Initial results show that seemingly the cluster centers alone can capture a large portion of the information content of the database, therefore largely reducing its redundancy. Also it was found an expected increase of cluster functional coher- ence and characterization with the stringency of the thresh- old, as well as the amount of information captured by the cluster centers.
    Full-text · Article ·
  • [Show abstract] [Hide abstract]
    ABSTRACT: Among the strategies that can lead to the discovery of new drugs, the identification and use of privileged structures, molecular fragments that are able to interact with more than one target, gained particular attention, in an attempt to find new drugs in a shorter time with respect to other strategies. These structures, that have been identified mainly by empirical observations, can target only a given protein family, or can be able to interact with more, unrelated targets. This review deals with structures not covered in recent papers on this topic, and emphasizes the importance of understanding the structure-target relationships, that confer the privileged status.
    No preview · Article · Feb 2006 · Current Medicinal Chemistry
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The underlying frameworks of natural product classes with multiple biological activities can be regarded as biologically selected and prevalidated starting points in vast chemical structure space in the development of compound collections for chemical biology and medicinal chemistry research. For the synthesis of natural product-derived and -inspired compound collections, the development of enantioselective transformations in a format amenable to library synthesis, e.g., on the solid support, is a major and largely unexplored goal. We report on the enantioselective solid-phase synthesis of a natural product-inspired alpha,beta-unsaturated delta-lactone collection and its investigation in cell-based screens monitoring cell cycle progression and viral entry into cells. The screens identified modulators of both biological processes at a high hit rate. The screen for inhibition of viral entry opens up avenues of research for the identification of compounds with antiviral activity.
    Full-text · Article · May 2007 · Chemistry & Biology
Show more