From Data to Chemistry: Revealing Causality and Reaction Coordinates through Interpretable Machine Learning in Supramolecular Transition Metal Catalysis

25 June 2024, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Supramolecular transition metal catalysts with tailored reaction environments allow for the usage of abundant 3d metals as catalytic centres, leading to more sustainable chemical processes. However, such catalysts are large and flexible systems with intricate interactions, resulting in complex reaction coordinates. To capture their dynamic nature, we developed a broadly applicable, high-throughput workflow, leveraging quantum mechanics/molecular mechanics (QM/MM) molecular dynamics in explicit solvent, to investigate a Cu(I)-calix[8]arene catalysed C-N coupling reaction. The system complexity and high amount of data generated from sampling the reaction require automated analyses. To identify and quantify the reaction coordinate from noisy simulation trajectories, we applied interpretable machine learning techniques (Lasso, Random Forest, Logistic Regression) in a consensus model, alongside dimensionality reduction methods (PCA, LDA, tICA). Leveraging a Granger Causality model, we go beyond the traditional view of a reaction coordinate, by defining it as a sequence of molecular motions that led up to the reaction.

Keywords

QM/MM MD
Interpretable Machine Learning
Computational Catalysis
Granger Causality

Supplementary materials

Title
Description
Actions
Title
Supporting Information
Description
Details concerning the computational modelling and the data analyses.
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.