Integrating diverse data for structure determination of macromolecular assemblies.

Department of Biopharmaceutical Sciences, and California Institute for Quantitative Biosciences, University of California at San Francisco, CA 94158-2330, USA.
Annual Review of Biochemistry (Impact Factor: 26.53). 08/2008; 77:443-77. DOI: 10.1146/annurev.biochem.77.060407.135530
Source: PubMed

ABSTRACT To understand the cell, we need to determine the macromolecular assembly structures, which may consist of tens to hundreds of components. First, we review the varied experimental data that characterize the assemblies at several levels of resolution. We then describe computational methods for generating the structures using these data. To maximize completeness, resolution, accuracy, precision, and efficiency of the structure determination, a computational approach is required that uses spatial information from a variety of experimental methods. We propose such an approach, defined by its three main components: a hierarchical representation of the assembly, a scoring function consisting of spatial restraints derived from experimental data, and an optimization method that generates structures consistent with the data. This approach is illustrated by determining the configuration of the 456 proteins in the nuclear pore complex (NPC) from baker's yeast. With these tools, we are poised to integrate structural information gathered at multiple levels of the biological hierarchy--from atoms to cells--into a common framework.

  • [Show abstract] [Hide abstract]
    ABSTRACT: Experimental structure determination continues to be challenging for membrane proteins. Computational prediction methods are therefore needed and widely used to supplement experimental data. Here, we re-examined the state of the art in transmembrane helix prediction based on a non-redundant dataset with 190 high-resolution structures. Analyzing 12 widely-used and well-known methods using a stringent performance measure, we largely confirmed the expected high level of performance. On the other hand, all methods performed worse for proteins that could not have been used for development. A few results stood out: Firstly, all methods predicted proteins in eukaryotes better than those in bacteria. Secondly, methods worked less well for proteins with many transmembrane helices. Thirdly, most methods correctly discriminated between soluble and transmembrane proteins. However, several older methods often mistook signal peptides for transmembrane helices. Some newer methods have overcome this shortcoming. In our hands, PolyPhobius and MEMSAT-SVM outperformed other methods. This article is protected by copyright. All rights reserved.
    Proteins Structure Function and Bioinformatics 12/2014; DOI:10.1002/prot.24749 · 3.34 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: This is an open access article under the CC BY-NC-ND license ( SUMMARY Describing, understanding, and modulating the func-tion of the cell require elucidation of the structures of macromolecular assemblies. Here, we describe an integrative method for modeling heteromeric com-plexes using as a starting point disassembly path-ways determined by native mass spectrometry (MS). In this method, the pathway data and other available information are encoded as a scoring func-tion on the positions of the subunits of the complex. The method was assessed on its ability to reproduce the native contacts in five benchmark cases with simulated MS data and two cases with real MS data. To illustrate the power of our method, we puri-fied the yeast initiation factor 3 (eIF3) complex and characterized it by native MS and chemical crosslink-ing MS. We established substoichiometric binding of eIF5 and derived a model for the five-subunit eIF3 complex, at domain level, consistent with its role as a scaffold for other initiation factors. INTRODUCTION
    Chemistry & Biology 01/2015; DOI:10.1016/j.chembiol.2014.11.010 · 6.59 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Data reporting on structure and dynamics of cellular constituents are growing with increasing pace enabling, as never before, the understanding of fine mechanistic aspects of biological systems and providing the possibility to affect them in controlled ways. Nonetheless, experimental techniques do not yet allow for an arbitrary level of resolution on cellular processes in situ. By consistently integrating a variety of diverse experimental data, molecular modeling is optimally poised to enhance to near-atomistic resolution our understanding of molecular recognition in large assemblies. Within this integrative modeling context, we briefly review in this chapter the recent progresses of molecular simulations at the atomistic and coarse-grained level of resolution to explore protein-protein interactions. In particular, we discuss our recent contributions in this field, which aim at providing a robust bridge between novel optimization algorithms and multiscale molecular simulations for a consistent integration of experimental inputs. We expect that, with the ever-growing sampling ability of molecular simulations and the tireless progress of experimental methods, the impact of such dynamic-based approach could only be more effective with time, contributing to provide detailed description of cellular organization. © 2014 Elsevier Inc. All rights reserved.
    Advances in Protein Chemistry and Structural Biology 01/2014; 96:77-111. DOI:10.1016/bs.apcsb.2014.06.008 · 3.74 Impact Factor

Full-text (3 Sources)

Available from
Jun 4, 2014