Chemical effects in biological systems (CEBS) object model for toxicology data, SysTox-OM: design and application.

Science Applications International Corporation, 20201 Century Boulevard, 3rd Floor, Germantown, MD 20874, USA.
Bioinformatics (Impact Factor: 5.32). 05/2006; 22(7):874-82. DOI: 10.1093/bioinformatics/btk045
Source: PubMed

ABSTRACT The CEBS data repository is being developed to promote a systems biology approach to understand the biological effects of environmental stressors. CEBS will house data from multiple gene expression platforms (transcriptomics), protein expression and protein-protein interaction (proteomics), and changes in low molecular weight metabolite levels (metabolomics) aligned by their detailed toxicological context. The system will accommodate extensive complex querying in a user-friendly manner. CEBS will store toxicological contexts including the study design details, treatment protocols, animal characteristics and conventional toxicological endpoints such as histopathology findings and clinical chemistry measures. All of these data types can be integrated in a seamless fashion to enable data query and analysis in a biologically meaningful manner.
An object model, the SysBio-OM (Xirasagar et al., 2004) has been designed to facilitate the integration of microarray gene expression, proteomics and metabolomics data in the CEBS database system. We now report SysTox-OM as an open source systems toxicology model designed to integrate toxicological context into gene expression experiments. The SysTox-OM model is comprehensive and leverages other open source efforts, namely, the Standard for Exchange of Nonclinical Data ( which is a data standard for capturing toxicological information for animal studies and Clinical Data Interchange Standards Consortium ( that serves as a standard for the exchange of clinical data. Such standardization increases the accuracy of data mining, interpretation and exchange. The open source SysTox-OM model, which can be implemented on various software platforms, is presented here.
A universal modeling language (UML) depiction of the entire SysTox-OM is available at and the Rational Rose object model package is distributed under an open source license that permits unrestricted academic and commercial use and is available at Currently, the public toxicological data in CEBS can be queried via a web application based on the SysTox-OM at
Supplementary data are available at Bioinformatics online.

1 Bookmark
  • [Show abstract] [Hide abstract]
    ABSTRACT: PCA (principal components analysis) and ANN (artificial neural network) are two broadly used pattern recognition methods in metabolomics data-mining. Yet their limitations sometimes are great obstacles for researchers. In this paper the wavelet transform (WT) method was used to integrate with PCA and ANN to improve their performance in manipulating metabolomics data. A dataset was decomposed by wavelets and then reconstructed. The "hard thresholding" algorithm was used, through which the detail information was discarded, and the entire "metabolomics image" reconstructed on the significant information. It was supposed that the most relevant information was captured after this process. It was found that, thanks to its ability in denoising data, the WT method could significantly improve the performance of the non-linear essence-extracting method ANN in classifying samples; further integration of WT with PCA showed that WT could greatly enhance the ability of PCA in distinguishing one group of samples from another and also its ability in identifying potential biomarkers. The results highlighted WT as a promising resolution in bridging the gap between huge bytes of data and the instructive biological information.
    Metabolomics 11/2007; 3(4):531-537. · 4.43 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Adverse drug reactions continue to be a major cause of morbidity in both patients receiving therapeutics and in drug R&D programs. Predicting and possibly eliminating these adverse events remains a high priority in industry, government agencies and healthcare systems. With small molecule candidates, the fusion of nonclinical and clinical data is essential in establishing an overall system that creates a true translational science approach. Several new advances are taking place that attempt to create a 'patient context' mechanism early in drug research and development and ultimately into the marketplace. This 'life-cycle' approach has as its core the development of human-oriented, nonclinical end points and the incorporation of clinical knowledge at the drug design stage. The next 5 years should witness an explosion of what the author views as druggable and safe chemical space, pharmacosafety molecular targets and the most important aspect, an understanding of unique susceptibilities in patients developing adverse drug reactions. Our current knowledge of clinical safety relies completely on pharmacovigilance data from approved and marketed drugs, with a few exceptions of drugs failing in clinical trials. Massive data repositories now and soon to be available via cloud computing should stimulate a major effort in expanding our view of clinical drug safety and its incorporation into early drug research and development.
    Expert Review of Clinical Pharmacology 03/2013; 6(2):185-95.
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Many drug or single nucleotide polymorphism (SNP)-related resources and tools have been developed, but connecting and integrating them is still a challenge. Here, we describe a user-friendly web-based software package, named Drug-SNPing, which provides a platform for the integration of drug information (DrugBank and PharmGKB), protein-protein interactions (STRING), tagSNP selection (HapMap) and genotyping information (dbSNP, REBASE and SNP500Cancer). DrugBank-based inputs include the following: (i) common name of the drug, (ii) synonym or drug brand name, (iii) gene name (HUGO) and (iv) keywords. PharmGKB-based inputs include the following: (i) gene name (HUGO), (ii) drug name and (iii) disease-related keywords. The output provides drug-related information, metabolizing enzymes and drug targets, as well as protein-protein interaction data. Importantly, tagSNPs of the selected genes are retrieved for genotyping analyses. All drug-based and protein-protein interaction-based SNP genotyping information are provided with PCR-RFLP (PCR-restriction enzyme length polymorphism) and TaqMan probes. Thus, users can enter any drug keywords/brand names to obtain immediate information that is highly relevant to genotyping for pharmacogenomics research.Availability and implementation: Drug-SNPing and its user manual are freely available at CONTACT:;;
    Bioinformatics 02/2013; · 5.47 Impact Factor

Full-text (2 Sources)

Available from
May 30, 2014