posted on 2011-08-22, 00:00authored byRadka Svobodová Vařeková, Stanislav Geidl, Crina-Maria Ionescu, Ondřej Skřehota, Michal Kudera, David Sehnal, Tomáš Bouchal, Ruben Abagyan, Heinrich J. Huber, Jaroslav Koča
The acid dissociation (ionization) constant pKa is one of the fundamental properties of organic molecules. We have evaluated different computational strategies and models to predict the pKa values of substituted phenols using partial atomic charges. Partial atomic charges for 124 phenol molecules were calculated using 83 approaches containing seven theory levels (MP2, HF, B3LYP, BLYP, BP86, AM1, and PM3), three basis sets (6-31G*, 6-311G, STO-3G), and five population analyses (MPA, NPA, Hirshfeld, MK, and Löwdin). The correlations between pKa and various atomic charge descriptors were examined, and the best descriptors were selected for preparing the quantitative structure–property relationship (QSPR) models. One QSPR model was created for each of the 83 approaches to charge calculation, and then the accuracy of all these models was analyzed and compared. The pKas predicted by most of the models correlate strongly with experimental pKa values. For example, more than 25% of the models have correlation coefficients (R2) greater than 0.95 and root-mean-square errors smaller than 0.49. All seven examined theory levels are applicable for pKa prediction from charges. The best results were obtained for the MP2 and HF level of theory. The most suitable basis set was found to be 6-31G*. The 6-311G basis set provided slightly weaker correlations, and unexpectedly also, the STO-3G basis set is applicable for the QSPR modeling of pKa. The Mulliken, natural, and Löwdin population analyses provide accurate models for all tested theory levels and basis sets. The results provided by the Hirshfeld population analysis were also acceptable, but the QSPR models based on MK charges show only weak correlations.