Large dataset of molecular and reaction energies from multi-level quantum chemical methods

09 September 2022, Version 2
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Well curated extensive datasets have have helped spur intense molecular machine learning(ML) method development activities over last few years encouraging non-chemists to be part of the effort as well. QM9 dataset is the one of the benchmark database for small molecules with molecular energies based on B3LYP functional. G4MP2 based energies of these molecules were published later. To enable a widely variety of ML tasks with QM9 molecules, in this article, we introduce a new dataset with QM9 molecule energies estimated with 76 different DFT functionals and three different basis sets (228 energy numbers for each molecule). We additionally enumerated all possible A <-> B molecular conversions within the QM9 dataset and provide the reaction energies based on these 76 functionals, and basis sets. Lastly, we also provide the bond changes for all the 162 million reactions with the dataset to enable structure and bond based reaction energy prediction tools based on ML.

Supplementary materials

Title
Description
Actions
Title
Large dataset of molecular and reaction energies from multi-level quantum chemical methods
Description
Modified and original keys presented as a table for the DFT methods used in the database.
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.