Protein-protein docking benchmark version 3.0

Bioinformatics Program, Boston University, Boston, Massachusetts 02215, USA.
Proteins Structure Function and Bioinformatics (Impact Factor: 2.92). 11/2008; 73(3):705-9. DOI: 10.1002/prot.22106
Source: PubMed

ABSTRACT We present version 3.0 of our publicly available protein-protein docking benchmark. This update includes 40 new test cases, representing a 48% increase from Benchmark 2.0. For all of the new cases, the crystal structures of both binding partners are available. As with Benchmark 2.0, Structural Classification of Proteins (Murzin et al., J Mol Biol 1995;247:536-540) was used to remove redundant test cases. The 124 unbound-unbound test cases in Benchmark 3.0 are classified into 88 rigid-body cases, 19 medium-difficulty cases, and 17 difficult cases, based on the degree of conformational change at the interface upon complex formation. In addition to providing the community with more test cases for evaluating docking methods, the expansion of Benchmark 3.0 will facilitate the development of new algorithms that require a large number of training examples. Benchmark 3.0 is available to the public at

Download full-text


Available from: Joël Janin, Mar 13, 2015
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Arabidopsis AtPRMT10 is a plant-specific type I protein arginine methyltransferase that can asymmetrically dimethylate arginine 3 of histone H4 with auto-methylation activity. Mutations of AtPRMT10 derepress FLOWERING LOCUS C (FLC) expression resulting in a late-flowering phenotype. Here, to further investigate the biochemical characteristics of AtPRMT10, we analyzed a series of mutated forms of the AtPRMT10 protein. We demonstrate that the conserved "VLD" residues and "double-E loop" are essential for enzymatic activity of AtPRMT10. In addition, we show that Arg54 and Cys259 of AtPRMT10, two residues unreported in animals, are also important for its enzymatic activity. We find that Arg13 of AtPRMT10 is the auto-methylation site. However, substitution of Arg13 to Lys13 does not affect its enzymatic activity. In vivo complementation assays reveal that plants expressing AtPRMT10 with VLD-AAA, E143Q or E152Q mutations retain high levels of FLC expression and fail to rescue the late-flowering phenotype of atprmt10 plants. Taken together, we conclude that the methyltransferase activity of AtPRMT10 is essential for repressing FLC expression and promoting flowering in Arabidopsis.
    Protein & Cell 06/2012; 3(6):450-9. DOI:10.1007/s13238-012-2935-3 · 2.85 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Structural knowledge about protein-protein interactions can provide insights to the basic processes underlying cell function. Recent progress in experimental and computational structural biology has led to a rapid growth of experimentally resolved structures and computationally determined near-native models of protein-protein interactions. However, determining whether a protein-protein interaction is physiological or it is the artifact of an experimental or computational method remains a challenging problem. In this work, we have addressed two related problems. The first problem is distinguishing between the experimentally obtained physiological and crystal-packing protein-protein interactions. The second problem is concerned with the classification of near-native and inaccurate docking models. We first defined a universal set of interface features and employed a support vector machines (SVM)-based approach to classify the interactions for both problems, with the accuracy, precision, and recall for the first problem classifier reaching 93%. To improve the classification, we next developed a semi-supervised learning approach for the second problem, using transductive SVM (TSVM). We applied both classifiers to a commonly used protein docking benchmark of 124 complexes. We found that while we reached the classification accuracies of 78.9% for the SVM classifier and 80.3% for the TSVM classifier, improving protein-docking methods by model re-ranking remains a challenging problem.
    Proteomics 11/2011; 11(22):4321-30. DOI:10.1002/pmic.201100217 · 3.97 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Our ability to infer the protein quaternary structure automatically from atom and lattice information is inadequate, especially for weak complexes, and heteromeric quaternary structures. Several approaches exist, but they have limited performance. Here, we present a new scheme to infer protein quaternary structure from lattice and protein information, with all-around coverage for strong, weak and very weak affinity homomeric and heteromeric complexes. The scheme combines naive Bayes classifier and point group symmetry under Boolean framework to detect quaternary structures in crystal lattice. It consistently produces ≥90% coverage across diverse benchmarking data sets, including a notably superior 95% coverage for recognition heteromeric complexes, compared with 53% on the same data set by current state-of-the-art method. The detailed study of a limited number of prediction-failed cases offers interesting insights into the intriguing nature of protein contacts in lattice. The findings have implications for accurate inference of quaternary states of proteins, especially weak affinity complexes.
    Structure 03/2011; 19(3):304-12. DOI:10.1016/j.str.2011.01.009 · 6.79 Impact Factor