Leveraging High-Spin DFT Features for Prediction of Spin State Gaps in 3d Transition Metal Complexes

26 March 2025, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Determining spin state gap energetics (SSE) in 3d transition metal complexes (TMCs) is a major challenge in theoretical chemistry, as high-level quantum methods, though reliable, are time-intensive for large-scale studies. This work explores a machine learning (ML)-based approach to predict DFT adiabatic SSE gaps using descriptors derived from a single high-spin DFT calculation. This approach is adopted to eliminate the differential treatment of electronic correlation between high spin (HS) and low spin (LS) structures. Our descriptors try to inculcate the knowledge of crystal field theory into the ML model. It includes atomic energy levels of bare metal ions, natural charges and d-orbital molecular orbital eigenvalues derived from an HS calculation, ligand HOMO-LUMO gaps, and simple identity-based features. We train ML models on 1434 SSE values spanning 934 complexes and demonstrate their transferability to more complex bidentate π-bonding ligands despite being trained on simpler Werner-type monodentate complexes. This approach bypasses the need for multi-reference low-spin (LS) optimizations while retaining predictive accuracy, offering a cost-effective strategy for SSE estimation in transition metal chemistry. We hope the insights covered in this study will contribute to the development of further electronic structure-based descriptors for SSE predictions.

Keywords

Spin state Gaps
ML
DFT
Transition metal complex
SSE prediction

Supplementary materials

Title
Description
Actions
Title
Supplementary Information for the main manuscript
Description
The document contains an elaborate description of the dataset, feature vector selection, ML predictions across different models and results obtained for transferability check of the model. The sections, figures and tables of this text are referred to in the main manuscript wherever detailed discussion is required.
Actions
Title
Dataset and Results
Description
This zipped folder contains all the training and test datasets and results. The attached Python code can generate the feature vectors from the G16 output file. The README file provides a short description of each file.
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.