Prediction of metal ion-binding sites in proteins using the fragment transformation method.

Graduate Institute of Molecular Systems Biomedicine, China Medical University, Taichung, Taiwan.
PLoS ONE (Impact Factor: 3.53). 06/2012; 7(6):e39252. DOI: 10.1371/journal.pone.0039252
Source: PubMed

ABSTRACT The structure of a protein determines its function and its interactions with other factors. Regions of proteins that interact with ligands, substrates, and/or other proteins, tend to be conserved both in sequence and structure, and the residues involved are usually in close spatial proximity. More than 70,000 protein structures are currently found in the Protein Data Bank, and approximately one-third contain metal ions essential for function. Identifying and characterizing metal ion-binding sites experimentally is time-consuming and costly. Many computational methods have been developed to identify metal ion-binding sites, and most use only sequence information. For the work reported herein, we developed a method that uses sequence and structural information to predict the residues in metal ion-binding sites. Six types of metal ion-binding templates- those involving Ca(2+), Cu(2+), Fe(3+), Mg(2+), Mn(2+), and Zn(2+)-were constructed using the residues within 3.5 Å of the center of the metal ion. Using the fragment transformation method, we then compared known metal ion-binding sites with the templates to assess the accuracy of our method. Our method achieved an overall 94.6 % accuracy with a true positive rate of 60.5 % at a 5 % false positive rate and therefore constitutes a significant improvement in metal-binding site prediction.

1 Bookmark
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Human schistosomiasis is a freshwater snail-transmitted disease caused by parasitic flatworms of the Schistosoma genus. Schistosoma haematobium, Schistosoma mansoni, and Schistosoma japonicum are the three major species infecting humans. These parasites undergo a complex developmental life cycle, in which they encounter a plethora of environmental signals. The presence of genes encoding the universal stress protein (USP) domain in the genomes of Schistosoma spp. suggests these flatworms are equipped to respond to unfavorable conditions. Though data on gene expression is available for USP genes, their biochemical and environmental regulation are incompletely understood. The identification of additional regulatory molecules for Schistosoma. USPs, which may be present in the human, snail, or water environments, could also be useful for schistosomiasis interventions. We developed a protocol that includes a visual analytics stage to facilitate integration, visualization, and decision making, from the results of sequence analyses and data collection on a set of 13 USPs from S. mansoni and S. japonicum. Multiple sequence alignment identified conserved sites that could be key residues regulating the function of USPs of the Schistosoma spp. Based on the consistency and completeness of sequence annotation, we prioritized for further research the gene for a 184-amino-acid-long USP that is present in the genomes of the three human-infecting Schistosoma spp. Calcium, zinc, and magnesium ions were predicted to interact with the protein product of the gene. Given that the initial effects of praziquantel on schistosomes include the influx of calcium ions, additional investigations are required to (1) functionally characterize the interactions of calcium ions with the amino acid residues of Schistosoma USPs; and (2) determine the transcriptional response of Schistosoma. USP genes to praziquantel. The data sets produced, and the visual analytics views that were developed, can be easily reused to develop new hypotheses.
    Advances and Applications in Bioinformatics and Chemistry 01/2013; 6:15-27.
  • [Show abstract] [Hide abstract]
    ABSTRACT: The survival of Mycobacterium tuberculosis depends on mycolic acids - very long α-alkyl-β-hydroxy fatty acids comprising 60 to 90 carbon atoms. However, despite considerable efforts, little is known about how enzymes involved in mycolic acid biosynthesis recognize and bind their hydrophobic fatty acyl substrates. The condensing enzyme KasA is pivotal for the synthesis of very long (C38-42) fatty acids, the precursors of mycolic acids. To probe the mechanism of substrate and inhibitor recognition by KasA, we determined the structure of this protein in complex with a mycobacterial phospholipid, and with several thiolactomycin derivatives that were designed as substrate analogs. Our structures provide consecutive snapshots along the reaction coordinate for the enzyme-catalyzed reaction, and support an induced-fit mechanism in which a wide cavity is established through the concerted opening of three gatekeeping residues and several α-helices. The stepwise characterization of the binding process provides mechanistic insights into the induced-fit recognition in this system and serves as an excellent foundation for the development of high affinity KasA inhibitors.
    Journal of Biological Chemistry 10/2013; · 4.60 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Zinc is one of the most essential metals utilized by organisms, and zinc-binding proteins play an important role in a variety of biological processes such as transcription regulation, cell metabolism and apoptosis. Thus, characterizing the precise zinc-binding sites is fundamental to an elucidation of the biological functions and molecular mechanisms of zinc-binding proteins. Using systematic analyses of structural characteristics, we observed that 4-residue and 3-residue zinc-binding sites have distinctly specific geometric features. Based on the results, we developed the novel computational program Geometric REstriction for Zinc-binding (GRE4Zn) to characterize the zinc-binding sites in protein structures, by restricting the distances between zinc and its coordinating atoms. The comparison between GRE4Zn and analogous tools revealed that it achieved a superior performance. A large-scale prediction for structurally characterized proteins was performed with this powerful predictor, and statistical analyses for the results indicated zinc-binding proteins have come to be significantly involved in more complicated biological processes in higher species than simpler species during the course of evolution. Further analyses suggested that zinc-binding proteins are preferentially implicated in a variety of diseases and highly enriched in known drug targets, and the prediction of zinc-binding sites can be helpful for the investigation of molecular mechanisms. In this regard, these prediction and analysis results should prove to be highly useful be helpful for further biomedical study and drug design. The online service of GRE4Zn is freely available at:
    Biochimica et Biophysica Acta 03/2013; · 4.66 Impact Factor

Full-text (2 Sources)

Available from
May 29, 2014