Integrating diverse data for structure determination of macromolecular assemblies.

Department of Biopharmaceutical Sciences, and California Institute for Quantitative Biosciences, University of California at San Francisco, CA 94158-2330, USA.
Annual Review of Biochemistry (Impact Factor: 26.53). 08/2008; 77:443-77. DOI: 10.1146/annurev.biochem.77.060407.135530
Source: PubMed

ABSTRACT To understand the cell, we need to determine the macromolecular assembly structures, which may consist of tens to hundreds of components. First, we review the varied experimental data that characterize the assemblies at several levels of resolution. We then describe computational methods for generating the structures using these data. To maximize completeness, resolution, accuracy, precision, and efficiency of the structure determination, a computational approach is required that uses spatial information from a variety of experimental methods. We propose such an approach, defined by its three main components: a hierarchical representation of the assembly, a scoring function consisting of spatial restraints derived from experimental data, and an optimization method that generates structures consistent with the data. This approach is illustrated by determining the configuration of the 456 proteins in the nuclear pore complex (NPC) from baker's yeast. With these tools, we are poised to integrate structural information gathered at multiple levels of the biological hierarchy--from atoms to cells--into a common framework.

  • [Show abstract] [Hide abstract]
    ABSTRACT: Experimental structure determination continues to be challenging for membrane proteins. Computational prediction methods are therefore needed and widely used to supplement experimental data. Here, we re-examined the state of the art in transmembrane helix prediction based on a non-redundant dataset with 190 high-resolution structures. Analyzing 12 widely-used and well-known methods using a stringent performance measure, we largely confirmed the expected high level of performance. On the other hand, all methods performed worse for proteins that could not have been used for development. A few results stood out: Firstly, all methods predicted proteins in eukaryotes better than those in bacteria. Secondly, methods worked less well for proteins with many transmembrane helices. Thirdly, most methods correctly discriminated between soluble and transmembrane proteins. However, several older methods often mistook signal peptides for transmembrane helices. Some newer methods have overcome this shortcoming. In our hands, PolyPhobius and MEMSAT-SVM outperformed other methods. This article is protected by copyright. All rights reserved.
    Proteins Structure Function and Bioinformatics 12/2014; 83(3). DOI:10.1002/prot.24749 · 2.92 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Data reporting on structure and dynamics of cellular constituents are growing with increasing pace enabling, as never before, the understanding of fine mechanistic aspects of biological systems and providing the possibility to affect them in controlled ways. Nonetheless, experimental techniques do not yet allow for an arbitrary level of resolution on cellular processes in situ. By consistently integrating a variety of diverse experimental data, molecular modeling is optimally poised to enhance to near-atomistic resolution our understanding of molecular recognition in large assemblies. Within this integrative modeling context, we briefly review in this chapter the recent progresses of molecular simulations at the atomistic and coarse-grained level of resolution to explore protein-protein interactions. In particular, we discuss our recent contributions in this field, which aim at providing a robust bridge between novel optimization algorithms and multiscale molecular simulations for a consistent integration of experimental inputs. We expect that, with the ever-growing sampling ability of molecular simulations and the tireless progress of experimental methods, the impact of such dynamic-based approach could only be more effective with time, contributing to provide detailed description of cellular organization. © 2014 Elsevier Inc. All rights reserved.
    Advances in Protein Chemistry and Structural Biology 01/2014; 96:77-111. DOI:10.1016/bs.apcsb.2014.06.008 · 3.74 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The nuclear pore complex (NPC) is one of the largest supramolecular structures in eukaryotic cells. Its octagonal ring-scaffold perforates the nuclear envelope and features a unique molecular machinery that regulates nucleocytoplasmic transport. NPCs are composed of ~30 different nucleoporins (Nups), averaged at 8, 16 or 32 copies per NPC. This estimate has not been confirmed for individual NPCs in living cells due to the inherent difficulty of counting proteins inside single supramolecular complexes. Here we used single-molecule SPEED microscopy to directly count the copy-number of twenty-four different Nups within individual NPCs of live yeast, and found agreement as well as significant deviation from previous estimates. As expected, we counted 8 copies of four peripheral Nups and 16 copies of fourteen scaffold Nups. Unexpectedly, we counted a maximum of 16 copies of Nsp1 and Nic96, rather than 32 as previously estimated; and found only 10–15 copies of six other Nups, rather than 8 or 16 copies as expected. This in situ molecular-counting technology can test structure-function models of NPCs and other supramolecular structures in cells.
    Scientific Reports 03/2015; 5. DOI:10.1038/srep09372 · 5.08 Impact Factor

Full-text (3 Sources)

Available from
Jun 4, 2014