[Show abstract][Hide abstract] ABSTRACT: The European REACH regulation requires information on ready biodegradation, which is a screening test to assess the biodegradability of chemicals. At the same time REACH encourages the use of alternatives to animal testing which includes predictions from QSAR models. The aim of this study was to build QSAR models to predict ready biodegradation of chemicals by using different modelling methods and types of molecular descriptors. Particular attention was given to data screening and validation procedures in order to build predictive models. Experimental values of 1055 chemicals were collected from the webpage of the National Institute of Technology and Evaluation of Japan (NITE): 837 and 218 molecules were used for calibration and testing purposes, respectively. In addition, models were further evaluated using an external validation set consisting of 670 molecules. Classification models were produced in order to discriminate biodegradable and non-biodegradable chemicals by means of different mathematical methods: k Nearest Neighbours, Partial Least Squares Discriminant Analysis and Support Vector Machines, as well as their consensus models. The proposed models and the derived consensus analysis demonstrated good classification performances with respect to already published QSAR models on biodegradation. Relationships between the molecular descriptors selected in each QSAR model and biodegradability were evaluated.
Journal of Chemical Information and Modeling 03/2013; 53:867-878. · 4.30 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: This paper presents four new QSAR models for CYP2C9 and CYP2D6 substrate recognition and inhibitor identification based on human clinical data. The models were used to screen a large data set of environmental chemicals for CYP activity, and to analyze the frequency of CYP activity among these compounds. A large fraction of these chemicals were found to be CYP active, and thus potentially capable of affecting human physiology. 20% of the compounds within applicability domain of the models were predicted to be CYP2C9 substrates, and 17% to be inhibitors. The corresponding numbers for CYP2D6 were 9% and 21%. Where the majority of CYP2C9 active compounds were predicted to be both a substrate and an inhibitor at the same time, the CYP2D6 active compounds were primarily predicted to be only inhibitors. It was demonstrated that the models could identify compound classes with a high occurrence of specific CYP activity. An overrepresentation was seen for poly-aromatic hydrocarbons (group of procarcinogens) among CYP2C9 active and mutagenic compounds compared to CYP2C9 inactive and mutagenic compounds. The mutagenicity was predicted with a QSAR model based on Ames in vitro test data.
[Show abstract][Hide abstract] ABSTRACT: Three modelling systems (MultiCase®, LeadScope® and MDL® QSAR) were used for construction of androgenic receptor antagonist models. There were 923-942 chemicals in the training sets. The models were cross-validated (leave-groups-out) with concordances of 77-81%, specificity of 78-91% and sensitivity of 51-76%. The specificity was highest in the MultiCase® model and the sensitivity was highest in the MDL® QSAR model. A complementary use of the models may be a valuable tool when optimizing the prediction of chemicals for androgenic receptor antagonism. When evaluating the fitness of the model for a particular application, balance of training sets, domain definition, and cut-offs for prediction interpretation should also be taken into account. Different descriptors in the modelling systems are illustrated with hydroxyflutamide and dexamethasone as examples (a non-steroid and a steroid anti-androgen, respectively). More research concerning the mechanism of anti-androgens would increase the possibility for further optimization of the QSAR models. Further expansion of the basis for the models is in progress, including the addition of more drugs.
SAR and QSAR in environmental research 03/2011; 22(1-2):35-49. · 1.68 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Human Cytochrome P450 (CYP) is a large group of enzymes that possess an essential function in metabolising different exogenous and endogenous compounds. Humans have more than 50 different genes encoding CYP enzymes, among these a gene encoding for the CYP isoenzyme 2D6, a CYP able to metabolise drugs and other chemicals. A training set of 747 chemicals primarily based on in vivo human data for the CYP isoenzyme 2D6 was collected from the literature. QSAR models focusing on substrate/non-substrate activity were constructed by the use of MultiCASE, Leadscope and MDL quantitative structure-activity relationship (QSAR) modelling systems. They cross validated (leave-groups-out) with concordances of 71%, 81% and 82%, respectively. Discrete organic European Inventory of Existing Commercial Chemical Substances (EINECS) chemicals were screened to predict an approximate percentage of CYP 2D6 substrates. These chemicals are potentially present in the environment. The biological importance of the CYP 2D6 and the use of the software mentioned above were discussed.
SAR and QSAR in environmental research 02/2009; 20(3-4):309-25. · 1.68 Impact Factor