Improved Modeling of Side-Chain-Base Interactions and Plasticity in Protein-DNA Interface Design

Department of Biochemistry, University of Washington, Seattle, WA 98195, USA.
Journal of Molecular Biology (Impact Factor: 3.96). 03/2012; 419(3-4):255-74. DOI: 10.1016/j.jmb.2012.03.005
Source: PubMed

ABSTRACT Combinatorial sequence optimization for protein design requires libraries of discrete side-chain conformations. The discreteness of these libraries is problematic, particularly for long, polar side chains, since favorable interactions can be missed. Previously, an approach to loop remodeling where protein backbone movement is directed by side-chain rotamers predicted to form interactions previously observed in native complexes (termed "motifs") was described. Here, we show how such motif libraries can be incorporated into combinatorial sequence optimization protocols and improve native complex recapitulation. Guided by the motif rotamer searches, we made improvements to the underlying energy function, increasing recapitulation of native interactions. To further test the methods, we carried out a comprehensive experimental scan of amino acid preferences in the I-AniI protein-DNA interface and found that many positions tolerated multiple amino acids. This sequence plasticity is not observed in the computational results because of the fixed-backbone approximation of the model. We improved modeling of this diversity by introducing DNA flexibility and reducing the convergence of the simulated annealing algorithm that drives the design process. In addition to serving as a benchmark, this extensive experimental data set provides insight into the types of interactions essential to maintain the function of this potential gene therapy reagent.

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: We describe the identification and characterization of novel homing endonucleases using genome database mining to identify putative target sites, followed by high throughput activity screening in a bacterial selection system. We characterized the substrate specificity and kinetics of these endonucleases by monitoring DNA cleavage events with deep sequencing. The endonuclease specificities revealed by these experiments can be partially recapitulated using 3D structure-based computational models. Analysis of these models together with genome sequence data provide insights into how alternative endonuclease specificities were generated during natural evolution.
    Nucleic Acids Research 11/2014; 42(22). DOI:10.1093/nar/gku1096 · 8.81 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Protein:DNA interactions are essential to a range of processes that maintain and express the information encoded in the genome. Structural modeling is an approach that aims to understand these interactions at the physicochemical level. It has been proposed that structural modeling can lead to deeper understanding of the mechanisms of protein:DNA interactions, and that progress in this field can not only help to rationalize the observed specificities of DNA-binding proteins but also to allow researchers to engineer novel DNA site specificities. In this review we discuss recent developments in the structural description of protein:DNA interactions and specificity, as well as the challenges facing the field in the future. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email:
    Briefings in functional genomics 11/2014; 14(1). DOI:10.1093/bfgp/elu044 · 3.43 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Computational design is becoming an integral component in developing novel enzymatic activities. Catalytic efficiencies of man-made enzymes however are far behind their natural counterparts. The discrepancy between laboratory and naturally evolved enzymes suggests that a major catalytic factor is still missing in the computational process. Reorganization energy, which is the origin of catalytic power of natural enzymes, has not been exploited yet for design. As exemplified in case of KE07 Kemp eliminase, this quantity is optimized by directed evolution. Mutations beneficial for evolution, but without direct impact on catalysis can be identified based on contributions to reorganization energy. We propose to incorporate the reorganization energy in scaffold selection to provide highly evolvable initial designs.
    Current opinion in chemical biology 04/2014; 21C:34-41. DOI:10.1016/j.cbpa.2014.03.011 · 7.65 Impact Factor


Available from