Enumeration of de novo Inorganic Complexes for Chemical Discovery and Machine Learning

17 June 2019, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Despite being attractive targets for functional materials, the discovery of transition metal complexes with high-throughput computational screening is challenged by the amount of feasible coordination numbers, spin states, or oxidation states and the potentially large sizes of ligands. To overcome these limitations, we take inspiration from organic chemistry where full enumeration of neutral, closed shell molecules under the constraint of size has enriched discovery efforts. We design monodentate and bidentate ligands from scratch for the construction of mononuclear, octahedral transition metal complexes with up to 13 heavy atoms (i.e., metal, C, N, O, P, or S). From > 11,000 theoretical ligands, we develop a heuristic score for ranking a chemically feasible 2,500 ligand subset, only 71 of which were previously included in common organic molecule databases. We characterize the top 20% of scored ligands with density functional theory (DFT) in an octahedral homoleptic ligand database (OHLDB). The OHLDB contains i) the geometry optimized structures of 1,250 homoleptic octahedral complexes obtained from the enumerated pool of ligands and an open-shell transition metal (M(II)/M(III), M = Cr, Mn, Fe, or Co), and ii) the resulting high-spin/low-spin adiabatic electronic energies (ΔEH-L) obtained with hybrid DFT. Over the OHLDB, we observe structure–property (i.e., ΔEH-L) relationships different from those expected on the basis of ligand field arguments or from our prior data sets. Finally, we demonstrate how incorporating OHLDB data into artificial neural network (ANN) training improves ANN out-of-sample performance on much larger transition metal complexes.

Keywords

de novo enumeration
machine learning
neural networks
inorganic chemistry

Supplementary materials

Title
Description
Actions
Title
OHLDB
Description
Actions
Title
SupplementaryMaterial
Description
Actions
Title
Data
Description
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.