tx500501h_si_001.pdf (609.08 kB)
Download filePredicting Hepatotoxicity Using ToxCast in Vitro Bioactivity and Chemical Structure
journal contribution
posted on 2015-04-20, 00:00 authored by Jie Liu, Kamel Mansouri, Richard
S. Judson, Matthew T. Martin, Huixiao Hong, Minjun Chen, Xiaowei Xu, Russell
S. Thomas, Imran ShahThe U.S. Tox21 and EPA ToxCast program
screen thousands of environmental
chemicals for bioactivity using hundreds of high-throughput in vitro assays to build predictive models of toxicity.
We represented chemicals based on bioactivity and chemical structure
descriptors, then used supervised machine learning to predict in vivo hepatotoxic effects. A set of 677 chemicals was
represented by 711 in vitro bioactivity descriptors
(from ToxCast assays), 4,376 chemical structure descriptors (from
QikProp, OpenBabel, PaDEL, and PubChem), and three hepatotoxicity
categories (from animal studies). Hepatotoxicants were defined by
rat liver histopathology observed after chronic chemical testing and
grouped into hypertrophy (161), injury (101) and proliferative lesions
(99). Classifiers were built using six machine learning algorithms:
linear discriminant analysis (LDA), Naïve Bayes (NB), support
vector machines (SVM), classification and regression trees (CART),
k-nearest neighbors (KNN), and an ensemble of these classifiers (ENSMB).
Classifiers of hepatotoxicity were built using chemical structure
descriptors, ToxCast bioactivity descriptors, and hybrid descriptors.
Predictive performance was evaluated using 10-fold cross-validation
testing and in-loop, filter-based, feature subset selection. Hybrid
classifiers had the best balanced accuracy for predicting hypertrophy
(0.84 ± 0.08), injury (0.80 ± 0.09), and proliferative lesions
(0.80 ± 0.10). Though chemical and bioactivity classifiers had
a similar balanced accuracy, the former were more sensitive, and the
latter were more specific. CART, ENSMB, and SVM classifiers performed
the best, and nuclear receptor activation and mitochondrial functions
were frequently found in highly predictive classifiers of hepatotoxicity.
ToxCast and ToxRefDB provide the largest and richest publicly available
data sets for mining linkages between the in vitro bioactivity of environmental chemicals and their adverse histopathological
outcomes. Our findings demonstrate the utility of high-throughput
assays for characterizing rodent hepatotoxicants, the benefit of using
hybrid representations that integrate bioactivity and chemical structure,
and the need for objective evaluation of classification performance.
History
Usage metrics
Categories
Keywords
data setsvivo hepatotoxic effectsSVM classifiersKNNreceptor activationVitro Bioactivitymining linkagesrat liver histopathologyrodent hepatotoxicantsbioactivity classifiershistopathological outcomesLDAchemical structure descriptorsmitochondrial functionsENSMBfeature subset selectionanimal studies677 chemicalsregression treesdiscriminant analysisChemical StructureThe U.Sbioactivity descriptorsNBchemical testingchemical structureobjective evaluationhepatotoxicity categoriesclassification performanceTox 21Hybrid classifiersToxCast bioactivity descriptorsEPA ToxCast program screen thousandsPredictive performance