Designing Compound Subsets: Comparison of Random and Rational Approaches Using
Statistical Simulation
Posted on 2007-11-26 - 00:00
Compound subsets, which may be screened where it is not feasible or desirable to screen all available
compounds, may be designed using rational or random selection. Literature on the relative performance of
random versus rational selection reports conflicting observations, possibly because some random subsets
might be more representative than others and perform better than subsets designed by rational means, or
vice versa. In order to address this likelihood, we simulated a large number of rationally designed subsets
for evaluation against an equally large number of randomly generated subsets. We found that our rationally
designed subsets give higher mean hit rates compared to those of the random ones. We also compared
subsets comprising random plates with subsets of random compounds and found that, while the mean hit
rate of both is the same, the former demonstrates more variation in the hit rate. The choice of compound
file, rational subset method, and ratio of the subset size to the compound file size are key factors in the
relative performance of random and rational selection, and statistical simulation is a viable way to identify
the selection approach appropriate for a subset.
CITE THIS COLLECTION
DataCiteDataCite
No result found
Yeap, Siew Kuen; Walley, Rosalind J.; Snarey, Mike; P. van Hoorn, Willem; Mason, Jonathan S. (2016). Designing Compound Subsets: Comparison of Random and Rational Approaches Using
Statistical Simulation. ACS Publications. Collection. https://doi.org/10.1021/ci600382m