IMGT/LIGM-DB: a systematized approach for ImMunoGeneTics database coherence and data distribution improvement.
ABSTRACT IMGT, the international ImMunoGeneTics database (http:(/)/imgt.cnusc.fr:8104), created by Marie-Paule Lefranc, Montpellier, France, is an integrated database specializing in antigen receptors and MHC of all vertebrate species. IMGT includes LIGM-DB, developed for Immunoglobulins and T-cell-receptors. LIGM-DB distributes high quality data with an important increment value added by the LIGM expert annotations. LIGM-DB accurate immunogenetics data is based on the standardization of biological knowledge related to keywords, annotation labels and gene identification. The management of such data resulting from biological research requires an high flexible implementation to quickly reflect up-to-date results, and to integrate new knowledge. We developed a systematized approach and defined LIGM-DB systems which manage and realize the major tasks for the database survey. In this paper, we will focus on the coherence system, which became absolutely crucial to maintain data quality as the database is growing up and as the biological knowledge continues to improve, and on the distribution system which makes LIGM-DB data easy to access, download and reuse. Efforts have been done to improve the data distribution procedures and adapt them to the current bioinformatics needs. Recently, we have developed an API which allows Java programmers to remotely access and integrate LIGM-DB data in other computer environments.
[Show abstract] [Hide abstract]
ABSTRACT: Background Immunoglobulin (IG) complementarity determining region (CDR) includes VH CDR1, VH CDR2, VH CDR3, VL CDR1, VL CDR2 and VL CDR3. Of these, VH CDR3 plays a dominant role in recognizing and binding antigens. Three major mechanisms are involved in the formation of the VH repertoire: germline gene rearrangement, junctional diversity and somatic hypermutation. Features of the generation mechanisms of VH repertoire in humans and mice share similarities while VH CDR3 amino acid (AA) composition differs. Previous studies have mainly focused on germline gene rearrangement and the composition and structure of the CDR3 AA in humans and mice. However the number of AA changes due to somatic hypermutation and analysis of the junctional mechanism have been ignored. Methods Here we analyzed 9,340 human and 6,657 murine unique productive sequences of immunoglobulin (IG) variable heavy (VH) domains derived from IMGT/LIGM-DB database to understand how VH CDR3 AA compositions significantly differed between human and mouse. These sequences were identified and analyzed by IMGT/HighV-QUEST (http://www.imgt.org), including gene usage, number of AA changes due to somatic hypermutation, AA length distribution of VH CDR3, AA composition, and junctional diversity. Results Analyses of human and murine IG repertoires showed significant differences. A higher number of AA changes due to somatic hypermutation and more abundant N-region addition were found in human compared to mouse, which might be an important factor leading to differences in VH CDR3 amino acid composition. Conclusions These findings are a benchmark for understanding VH repertoires and can be used to characterize the VH repertoire during immune responses. The study will allow standardized comparison for high throughput results obtained by IMGT/HighV-QUEST, the reference portal for NGS repertoire.Theoretical Biology and Medical Modelling 07/2014; 11(1):30. DOI:10.1186/1742-4682-11-30 · 1.27 Impact Factor
Computers & Chemistry 02/2000; 24(1):129-32. DOI:10.1016/S0097-8485(00)80012-1