posted on 2023-11-21, 18:24authored byJackson
W. Burns, David M. Rogers
Quantitative structure–odor relationships are
critically
important for studies related to the function of olfaction. Current
literature data sets contain expert-labeled molecules but lack feature
data. This paper introduces QuantumScents, a quantum mechanics augmented
derivative of the Leffingwell data set. QuantumScents contains 3.5k
structurally and chemically diverse molecules ranging from 2 to 30
heavy atoms (CNOS) and their corresponding 3D coordinates, total PBE0
energy, molecular dipole moment, and per-atom Hirshfeld charges, dipoles,
and ratios. The authors demonstrate that Hirshfeld charges and ratios
contain sufficient information to perform molecular classification
by training a Message Passing Neural Network with chemprop (Heid, E.; et al. ChemRxiv, 2023,
DOI: 10.26434/chemrxiv-2023-3zcfl) to predict scent labels.
The QuantumScents data set is freely available on Zenodo along with
the authors’ code, example models, and data set generation
workflow (https://zenodo.org/doi/10.5281/zenodo.8239853).