posted on 2005-07-25, 00:00authored byC. W. Yap, Y. Z. Chen
Statistical learning methods have been used in developing filters for predicting inhibitors of two P450
isoenzymes, CYP3A4 and CYP2D6. This work explores the use of different statistical learning methods for
predicting inhibitors of these enzymes and an additional P450 enzyme, CYP2C9, and the substrates of the
three P450 isoenzymes. Two consensus support vector machine (CSVM) methods, “positive majority” (PM-CSVM) and “positive probability” (PP-CSVM), were used in this work. These methods were first tested for
the prediction of inhibitors of CYP3A4 and CYP2D6 by using a significantly higher number of inhibitors
and noninhibitors than that used in earlier studies. They were then applied to the prediction of inhibitors of
CYP2C9 and substrates of the three enzymes. Both methods predict inhibitors of CYP3A4 and CYP2D6 at
a similar level of accuracy as those of earlier studies. For classification of inhibitors of CYP2C9, the best
CSVM method gives an accuracy of 88.9% for inhibitors and 96.3% for noninhibitors. The accuracies for
classification of substrates and nonsubstrates of CYP3A4, CYP2D6, and CYP2C9 are 98.2 and 90.9%, 96.6
and 94.4%, and 85.7 and 98.8%, respectively. Both CSVM methods are potentially useful as filters for
predicting inhibitors and substrates of P450 isoenzymes. These methods generally give better accuracies
than single SVM classification systems, and the performance of the PP-CSVM method is slightly better
than that of the PM-CSVM method.