[show abstract][hide abstract] ABSTRACT: A problem often encountered in multidimensional NMR-spectroscopy is that an existing chemical shift list of a protein has
to be used to assign an experimental spectrum but does not fit sufficiently well for a safe assignment. A similar problem
occurs when temperature or pressure series of n-dimensional spectra are to be evaluated automatically. We have developed two different algorithms, AUREMOL-SHIFTOPT1 and
AUREMOL-SHIFTOPT2 that fulfill this task. In the present contribution their performance is analyzed employing a set of simulated
and experimental two-dimensional and three-dimensional spectra obtained from three different proteins. A new z-score based on atom and amino acid specific chemical shift distributions is introduced to weight the chemical shift contributions
in different dimensions properly.
Journal of Biomolecular NMR 04/2012; 43(4):197-210. · 2.85 Impact Factor
[show abstract][hide abstract] ABSTRACT: We present here the computer program AUREMOL-RFAC-3D that is a generalization of the previously published program RFAC for the fully automated estimation of residual indices (R-factors) from 2D NOESY spectra. It is part of the larger AUREMOL software package (www.auremol.de). RFAC-3D calculates R-factors directly from two-dimensional homonuclear NOESY spectra as well as from three-dimensional (15)N or (13)C edited NOESY-HSQC spectra and thus extends the application range to larger proteins. The fully automated method includes automated peak picking and integration, a Bayesian noise and artifact recognition and the use of the complete relaxation matrix formalism. To enhance the reliability of the calculated R-factors the method is also generalized to calculate combined R-factors from a set of 2D and 3D-spectra. For an optimal combination of the information derived from different sources a plausible formalism had to be derived. In addition, we present a novel direct R-factors based measure that correlates an R-factors as defined in this paper to the root mean square deviation of the actual structure from the optimal structure. The new program has been successfully tested on the histidine containing phosphocarrier protein (HPr) from Staphylococcus carnosus and on the Ras-binding domain (RBD) of the Ral guanine-nucleotide dissociation stimulation factor (RalGDS).
Journal of Biomolecular NMR 02/2007; 37(1):15-30. · 2.85 Impact Factor
[show abstract][hide abstract] ABSTRACT: This paper describes the developments, role and contributions of the NMR spectroscopy groups in the Structural Proteomics In Europe (SPINE) consortium. Focusing on the development of high-throughput (HTP) pipelines for NMR structure determinations of proteins, all aspects from sample preparation, data acquisition, data processing, data analysis to structure determination have been improved with respect to sensitivity, automation, speed, robustness and validation. Specific highlights are protonless (13)C-direct detection methods and inferential structure determinations (ISD). In addition to technological improvements, these methods have been applied to deliver over 60 NMR structures of proteins, among which are five that failed to crystallize. The inclusion of NMR spectroscopy in structural proteomics pipelines improves the success rate for protein structure determinations.
[show abstract][hide abstract] ABSTRACT: Rapid and accurate three-dimensional structure determination of biological macromolecules is mandatory to keep up with the vast progress made in the identification of primary sequence information. During the last few years the amount of data deposited in the protein data bank has substantially increased providing additional information for novel structure determination projects. The key question is how to combine the available database information with the experimental data of the current project ensuring that only relevant information is used and a correct structural bias is produced. For this purpose a novel fully automated algorithm based on Bayesian reasoning has been developed. It allows the combination of structural information from different sources in a consistent way to obtain high quality structures with a limited set of experimental data. The new ISIC (Intelligent Structural Information Combination) algorithm is part of the larger AUREMOL software package.
Our new approach was successfully tested on the improvement of the solution NMR structures of the Ras-binding domain of Byr2 from Schizosaccharomyces pombe, the Ras-binding domain of RalGDS from human calculated from a limited set of NMR data, and the immunoglobulin binding domain from protein G from Streptococcus by their corresponding X-ray structures. In all test cases clearly improved structures were obtained. The largest danger in using data from other sources is a possible bias towards the added structure. In the worst case instead of a refined target structure the structure from the additional source is essentially reproduced. We could clearly show that the ISIC algorithm treats these difficulties properly.
In summary, we present a novel fully automated method to combine strongly coupled knowledge from different sources. The combination with validation tools such as the calculation of NMR R-factors strengthens the impact of the method considerably since the improvement of the structures can be assessed quantitatively. The ISIC method can be applied to a large number of similar problems where the quality of the obtained three-dimensional structures is limited by the available experimental data like the improvement of large NMR structures calculated from sparse experimental data or the refinement of low resolution X-ray structures. Also structures may be refined using other available structural information such as homology models.
[show abstract][hide abstract] ABSTRACT: RELAX-JT2 is an extension of RELAX, a program for the simulation of 1H 2D NOESY spectra and (15)N or (13)C edited 3D NOESY-HSQC spectra of biological macromolecules. In addition to the already existing NOE-simulation it allows the proper simulation of line shapes by the integrated calculation of T(2) times and multiplet structures caused by J-couplings. Additionally the effects of relaxation mediated by chemical shift anisotropy are taken into account. The new routines have been implemented in the program AUREMOL, which aims at the automated NMR structure determination of proteins in solution. For a manual or automatic assignment of experimental spectra that is based on the comparison with the corresponding simulated spectra, the additional line shape information now available is a valuable aid. The new features have been successfully tested with the histidine-containing phosphocarrier protein HPr from Staphylococcus carnosus.
Journal of Biomolecular NMR 11/2004; 30(2):121-31. · 2.85 Impact Factor
[show abstract][hide abstract] ABSTRACT: Automated assignment of NOESY spectra is a prerequisite for automated structure determination of biological macromolecules. With the program KNOWNOE we present a novel, knowledge based approach to this problem. KNOWNOE is devised to work directly with the experimental spectra without interference of an expert. Besides making use of routines already implemented in AUREMOL, it contains as a central part a knowledge driven Bayesian algorithm for solving ambiguities in the NOE assignments. These ambiguities mainly arise from chemical shift degeneration which allows multiple assignments of cross peaks. Using a set of 326 protein NMR structures, statistical tables in the form of atom-pairwise volume probability distributions (VPDs) were derived. VPDs for all assignment possibilities relevant to the assignments of interproton NOEs were calculated. With these data for a given cross peak with N possible assignments Ai (i = 1,...,N) the conditional probabilities P(Ai, a/V0) can be calculated that the assignment Ai determines essentially all (a-times) of the cross peak volume V0. An assignment Ak with a probability P(Ak, a/V0) higher than 0.8 is transiently considered as unambiguously assigned. With a list of unambiguously assigned peaks a set of structures is calculated. These structures are used as input for a next cycle of iteration where a distance threshold Dmax is dynamically reduced. The program KNOWNOE was tested on NOESY spectra of a medium size protein, the cold shock protein (TmCsp) from Thermotoga maritima. The results show that a high quality structure of this protein can be obtained by automated assignment of NOESY spectra which is at least as good as the structure obtained from manual data evaluation.
Journal of Biomolecular NMR 09/2002; 23(4):271-87. · 2.85 Impact Factor