Abstract
Workflows to predict chemical reaction networks based on density functional theory (DFT) are prone to systematic errors in reaction energy due to the extensive use of cheap DFT exchange-correlation functionals to limit computational cost. Recently, machine learning-based models are increasingly applied to mitigate this problem. However, machine learning models require systems similar to trained data, and the models often perform poorly for out-of-distribution systems. Here, we present a simple bond-based correction method that improves the accuracy of DFT-derived reaction energies. It is based on linear regression, and the correction terms for each bond are derived from reactions among the QM9 dataset. We demonstrate the effectiveness of this method with three DFT functionals in three different rungs of Jacob's ladder. The simple correction method is effective for all rung but especially so for the cheapest PBE functional. Finally, we applied the correction method to a few reactions with molecules significantly different from those in the QM9 dataset that was used to fit the linear regression model. Once corrected by this method, we found that the DFT reaction energies for such out-of-distribution reactions are within 0.05eV of the G4MP2 method.
Supplementary materials
Title
Cheap turns superior: A linear regression-based correction method to reaction energy from DFT
Description
The log files are provided in a zip folder which contains the log files from relavent calculations related to the application part of the main manuscript is provided. A short description about the filenames are provided in a file named INFO.txt.
Actions