Article

Combining specificity determining and conserved residues improves functional site prediction.

EMBL Heidelberg, Heidelberg, Germany.
BMC Bioinformatics (impact factor: 2.75). 01/2009; 10. DOI:10.1186/1471-2105-10-174
Source: DBLP

ABSTRACT Predicting the location of functionally important sites from protein sequence and/or structure is a long-standing problem in computational biology. Most current approaches make use of sequence conservation, assuming that amino acid residues conserved within a protein family are most likely to be functionally important. Most often these approaches do not consider many residues that act to define specific sub-functions within a family, or they make no distinction between residues important for function and those more relevant for maintaining structure (e.g. in the hydrophobic core). Many protein families bind and/or act on a variety of ligands, meaning that conserved residues often only bind a common ligand sub-structure or perform general catalytic activities.
Here we present a novel method for functional site prediction based on identification of conserved positions, as well as those responsible for determining ligand specificity. We define Specificity-Determining Positions (SDPs), as those occupied by conserved residues within sub-groups of proteins in a family having a common specificity, but differ between groups, and are thus likely to account for specific recognition events. We benchmark the approach on enzyme families of known 3D structure with bound substrates, and find that in nearly all families residues predicted by SDPsite are in contact with the bound substrate, and that the addition of SDPs significantly improves functional site prediction accuracy. We apply SDPsite to various families of proteins containing known three-dimensional structures, but lacking clear functional annotations, and discusse several illustrative examples.
The results suggest a better means to predict functional details for the thousands of protein structures determined prior to a clear understanding of molecular function.

0 0
 · 
0 Bookmarks
 · 
45 Views
  • Source
    Article: New knowledge from old: in silico discovery of novel protein domains in Streptomyces coelicolor.
    [show abstract] [hide abstract]
    ABSTRACT: Streptomyces coelicolor has long been considered a remarkable bacterium with a complex life-cycle, ubiquitous environmental distribution, linear chromosomes and plasmids, and a huge range of pharmaceutically useful secondary metabolites. Completion of the genome sequence demonstrated that this diversity carried through to the genetic level, with over 7000 genes identified. We sought to expand our understanding of this organism at the molecular level through identification and annotation of novel protein domains. Protein domains are the evolutionary conserved units from which proteins are formed. Two automated methods were employed to rapidly generate an optimised set of targets, which were subsequently analysed manually. A final set of 37 domains or structural repeats, represented 204 times in the genome, was developed. Using these families enabled us to correlate items of information from many different resources. Several immediately enhance our understanding both of S. coelicolor and also general bacterial molecular mechanisms, including cell wall biosynthesis regulation and streptomycete telomere maintenance. Delineation of protein domain families enables detailed analysis of protein function, as well as identification of likely regions or residues of particular interest. Hence this kind of prior approach can increase the rate of discovery in the laboratory. Furthermore we demonstrate that using this type of in silico method it is possible to fairly rapidly generate new biological information from previously uncorrelated data.
    BMC Microbiology 03/2003; 3:3. · 3.04 Impact Factor
  • Source
    Article: Structure of YciI from Haemophilus influenzae (HI0828) reveals a ferredoxin-like alpha/beta-fold with a histidine/aspartate centered catalytic site.
    Proteins Structure Function and Bioinformatics 06/2005; 59(3):648-52. · 3.39 Impact Factor
  • Article: Comparative genomics of the vitamin B12 metabolism and regulation in prokaryotes.
    [show abstract] [hide abstract]
    ABSTRACT: Using comparative analysis of genes, operons, and regulatory elements, we describe the cobalamin (vitamin B12) biosynthetic pathway in available prokaryotic genomes. Here we found a highly conserved RNA secondary structure, the regulatory B12 element, which is widely distributed in the upstream regions of cobalamin biosynthetic/transport genes in eubacteria. In addition, the binding signal (CBL-box) for a hypothetical B12 regulator was identified in some archaea. A search for B12 elements and CBL-boxes and positional analysis identified a large number of new candidate B12-regulated genes in various prokaryotes. Among newly assigned functions associated with the cobalamin biosynthesis, there are several new types of cobalt transporters, ChlI and ChlD subunits of the CobN-dependent cobaltochelatase complex, cobalt reductase BluB, adenosyltransferase PduO, several new proteins linked to the lower ligand assembly pathway, l-threonine kinase PduX, and a large number of other hypothetical proteins. Most missing genes detected within the cobalamin biosynthetic pathways of various bacteria were identified as nonorthologous substitutes. The variable parts of the cobalamin metabolism appear to be the cobalt transport and insertion, the CobG/CbiG- and CobF/CbiD-catalyzed reactions, and the lower ligand synthesis pathway. The most interesting result of analysis of B12 elements is that B12-independent isozymes of the methionine synthase and ribonucleotide reductase are regulated by B12 elements in bacteria that have both B12-dependent and B12-independent isozymes. Moreover, B12 regulons of various bacteria are thought to include enzymes from known B12-dependent or alternative pathways.
    Journal of Biological Chemistry 11/2003; 278(42):41148-59. · 4.77 Impact Factor

Full-text (2 Sources)

View
6 Downloads
Available from
24 Jan 2013