IMGT/LIGM-DB: a systematized approach for ImMunoGeneTics database coherence and data distribution improvement.
ABSTRACT IMGT, the international ImMunoGeneTics database (http:(/)/imgt.cnusc.fr:8104), created by Marie-Paule Lefranc, Montpellier, France, is an integrated database specializing in antigen receptors and MHC of all vertebrate species. IMGT includes LIGM-DB, developed for Immunoglobulins and T-cell-receptors. LIGM-DB distributes high quality data with an important increment value added by the LIGM expert annotations. LIGM-DB accurate immunogenetics data is based on the standardization of biological knowledge related to keywords, annotation labels and gene identification. The management of such data resulting from biological research requires an high flexible implementation to quickly reflect up-to-date results, and to integrate new knowledge. We developed a systematized approach and defined LIGM-DB systems which manage and realize the major tasks for the database survey. In this paper, we will focus on the coherence system, which became absolutely crucial to maintain data quality as the database is growing up and as the biological knowledge continues to improve, and on the distribution system which makes LIGM-DB data easy to access, download and reuse. Efforts have been done to improve the data distribution procedures and adapt them to the current bioinformatics needs. Recently, we have developed an API which allows Java programmers to remotely access and integrate LIGM-DB data in other computer environments.
- SourceAvailable from: Marie-Paule Lefranc[Show abstract] [Hide abstract]
ABSTRACT: MOTIVATION: IMGT, the international ImMunoGeneTics database (http:@imgt.cines.fr:8104), created by M.-P. Lefranc, is an integrated database specializing in antigen receptors (immunoglobulins and T-cell receptors) and major histocompatibility complex (MHC) of all vertebrate species. IMGT accurate immunogenetics data are based on the standardization of the biological knowledge provided by the 'ImMunoGeneTics' IMGT-ONTOLOGY. The IMGT-ONTOLOGY describes the classification and specification of terms needed for immunogenetics and bioinformatics. IMGT-ONTOLOGY covers four main concepts: 'IDENTIFICATION', 'DESCRIPTION', 'CLASSIFICATION' and 'OBTENTION'. These concepts allow an extensive and standardized description and characterization of immunoglobulin and T-cell receptor data. The controlled vocabulary and the annotation rules are indispensable to ensure accuracy, consistency and coherence in IMGT. IMGT-ONTOLOGY allows scientists and clinicians to use, for the first time, identical terms with the same meaning in immunogenetics. It provides a semantic repository that will improve interoperability between specialist and generalist databases.Bioinformatics 01/2000; 15(12):1047-54. · 5.32 Impact Factor
- [Show abstract] [Hide abstract]
ABSTRACT: Immunoglobulin (IG) complementarity determining region (CDR) includes VH CDR1, VH CDR2, VH CDR3, VL CDR1, VL CDR2 and VL CDR3. Of these, VH CDR3 plays a dominant role in recognizing and binding antigens. Three major mechanisms are involved in the formation of the VH repertoire: germline gene rearrangement, junctional diversity and somatic hypermutation. Features of the generation mechanisms of VH repertoire in humans and mice share similarities while VH CDR3 amino acid (AA) composition differs. Previous studies have mainly focused on germline gene rearrangement and the composition and structure of the CDR3 AA in humans and mice. However the number of AA changes due to somatic hypermutation and analysis of the junctional mechanism have been ignored.Theoretical Biology and Medical Modelling 07/2014; 11(1):30. · 1.46 Impact Factor