Article

Chemical effects in biological systems (CEBS) object model for toxicology data, SysTox-OM: design and application.

Science Applications International Corporation, 20201 Century Boulevard, 3rd Floor, Germantown, MD 20874, USA.
Bioinformatics (Impact Factor: 5.32). 05/2006; 22(7):874-82. DOI: 10.1093/bioinformatics/btk045
Source: PubMed

ABSTRACT The CEBS data repository is being developed to promote a systems biology approach to understand the biological effects of environmental stressors. CEBS will house data from multiple gene expression platforms (transcriptomics), protein expression and protein-protein interaction (proteomics), and changes in low molecular weight metabolite levels (metabolomics) aligned by their detailed toxicological context. The system will accommodate extensive complex querying in a user-friendly manner. CEBS will store toxicological contexts including the study design details, treatment protocols, animal characteristics and conventional toxicological endpoints such as histopathology findings and clinical chemistry measures. All of these data types can be integrated in a seamless fashion to enable data query and analysis in a biologically meaningful manner.
An object model, the SysBio-OM (Xirasagar et al., 2004) has been designed to facilitate the integration of microarray gene expression, proteomics and metabolomics data in the CEBS database system. We now report SysTox-OM as an open source systems toxicology model designed to integrate toxicological context into gene expression experiments. The SysTox-OM model is comprehensive and leverages other open source efforts, namely, the Standard for Exchange of Nonclinical Data (http://www.cdisc.org/models/send/v2/index.html) which is a data standard for capturing toxicological information for animal studies and Clinical Data Interchange Standards Consortium (http://www.cdisc.org/models/sdtm/index.html) that serves as a standard for the exchange of clinical data. Such standardization increases the accuracy of data mining, interpretation and exchange. The open source SysTox-OM model, which can be implemented on various software platforms, is presented here.
A universal modeling language (UML) depiction of the entire SysTox-OM is available at http://cebs.niehs.nih.gov and the Rational Rose object model package is distributed under an open source license that permits unrestricted academic and commercial use and is available at http://cebs.niehs.nih.gov/cebsdownloads. Currently, the public toxicological data in CEBS can be queried via a web application based on the SysTox-OM at http://cebs.niehs.nih.gov
xirasagars@saic.com
Supplementary data are available at Bioinformatics online.

0 Bookmarks
 · 
77 Views
  • [Show abstract] [Hide abstract]
    ABSTRACT: PCA (principal components analysis) and ANN (artificial neural network) are two broadly used pattern recognition methods in metabolomics data-mining. Yet their limitations sometimes are great obstacles for researchers. In this paper the wavelet transform (WT) method was used to integrate with PCA and ANN to improve their performance in manipulating metabolomics data. A dataset was decomposed by wavelets and then reconstructed. The "hard thresholding" algorithm was used, through which the detail information was discarded, and the entire "metabolomics image" reconstructed on the significant information. It was supposed that the most relevant information was captured after this process. It was found that, thanks to its ability in denoising data, the WT method could significantly improve the performance of the non-linear essence-extracting method ANN in classifying samples; further integration of WT with PCA showed that WT could greatly enhance the ability of PCA in distinguishing one group of samples from another and also its ability in identifying potential biomarkers. The results highlighted WT as a promising resolution in bridging the gap between huge bytes of data and the instructive biological information.
    Metabolomics 11/2007; 3(4):531-537. · 4.43 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Numbers of microarrays have been examined and several public and commercial databases have been developed. However, it is not easy to compare in-house microarray data with those in a database because of insufficient reproducibility due to differences in the experimental conditions. As one of the approach to use these databases, we developed the similar compounds searching system (SCSS) on a toxicogenomics database. The datasets of 55 compounds administered to rats in the Toxicogenomics Project (TGP) database in Japan were used in this study. Using the fold-change ranking method developed by Lamb et al. [Lamb, J., Crawford, E.D., Peck, D., Modell, J.W., Blat, I.C., Wrobel, M.J., Lerner, J., Brunet, J.P., Subramanian, A., Ross, K.N., Reich, M., Hieronymus, H., Wei, G., Armstrong, S.A., Haggarty, S.J., Clemons, P.A., Wei, R., Carr, S.A., Lander, E.S., Golub, T.R., 2006. The connectivity map: using gene-expression signatures to connect small molecules, genes, and disease. Science 313, 1929-1935] and criteria called hit ratio, the system let us compare in-house microarray data and those in the database. In-house generated data for clofibrate, phenobarbital, and a proprietary compound were tested to evaluate the performance of the SCSS method. Phenobarbital and clofibrate, which were included in the TGP database, scored highest by the SCSS method. Other high scoring compounds had effects similar to either phenobarbital (a cytochrome P450s inducer) or clofibrate (a peroxisome proliferator). Some of high scoring compounds identified using the proprietary compound-administered rats have been known to cause similar toxicological changes in different species. Our results suggest that the SCSS method could be used in drug discovery and development. Moreover, this method may be a powerful tool to understand the mechanisms by which biological systems respond to various chemical compounds and may also predict adverse effects of new compounds.
    Toxicology Letters 09/2008; 186(1):52-7. · 3.15 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Adverse drug reactions continue to be a major cause of morbidity in both patients receiving therapeutics and in drug R&D programs. Predicting and possibly eliminating these adverse events remains a high priority in industry, government agencies and healthcare systems. With small molecule candidates, the fusion of nonclinical and clinical data is essential in establishing an overall system that creates a true translational science approach. Several new advances are taking place that attempt to create a 'patient context' mechanism early in drug research and development and ultimately into the marketplace. This 'life-cycle' approach has as its core the development of human-oriented, nonclinical end points and the incorporation of clinical knowledge at the drug design stage. The next 5 years should witness an explosion of what the author views as druggable and safe chemical space, pharmacosafety molecular targets and the most important aspect, an understanding of unique susceptibilities in patients developing adverse drug reactions. Our current knowledge of clinical safety relies completely on pharmacovigilance data from approved and marketed drugs, with a few exceptions of drugs failing in clinical trials. Massive data repositories now and soon to be available via cloud computing should stimulate a major effort in expanding our view of clinical drug safety and its incorporation into early drug research and development.
    Expert Review of Clinical Pharmacology 03/2013; 6(2):185-95.

Full-text (2 Sources)

View
18 Downloads
Available from
May 30, 2014