SNAr Regioselectivity Predictions: Machine Learning Trigger-ing DFT Reaction Modeling through Statistical Threshold

Yanfei Guan; Taegyo Lee; Ke Wang; Shu Yu; J.Christopher McWilliams

doi:10.26434/chemrxiv-2022-504v2

Organic Chemistry

Search within Organic Chemistry

SNAr Regioselectivity Predictions: Machine Learning Trigger-ing DFT Reaction Modeling through Statistical Threshold

16 December 2022, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Fast and accurate prospective predictions of the regioselectivity can significantly reduce the time and resources spent on unproductive transformations in the pharmaceutical industry. Density functional theory (DFT) reaction modeling through transition state theory (TST) and machine learning (ML) methods have been widely used to predict reaction outcomes such as selectivity. However, TST reaction modeling and ML methods are either time-consuming or data dependent. Herein, we introduce a prototype seamlessly bridging machine learning and TST modeling by triggering the resource-intensive but much less domain sensitive DFT calculation only on less confident ML predictions. The proposed workflow was trained and tested on both Pfizer internal dataset and USPTO public dataset to predict regioselectivity for SNAr reactions. Our method is accurate and fast which achieves 96.3% and 94.7% accuracy predicting the correct major product on Pfizer and USPTO datasets, respectively, in a fraction of conventional TST computing time.

Keywords

SNAr Regioselectivity

Reactivity predictions

DFT

Machine Learning

Process development

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

SNAr Regioselectivity Predictions: Machine Learning Triggering DFT Reaction Modeling through Statistical Threshold

Yanfei Guan, Taegyo Lee, Ke Wang, Shu Yu, J. Christopher McWilliams journal article

Journal of Chemical Information and Modeling , Volume 63, Issue 12

Online publication date: Jun 05, 2023

Version History

Dec 16, 2022 Version 1

Metrics

1,019

754

Views

Downloads

Citations

License

The content is available under CC BY NC 4.0

DOI

10.26434/chemrxiv-2022-504v2

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) declare that they have sought and gained approval from the relevant ethics committee/IRB for this research and its publication.

SNAr Regioselectivity Predictions: Machine Learning Trigger-ing DFT Reaction Modeling through Statistical Threshold

Authors

Abstract

Keywords

Comments

Now Published

Version History

Metrics

License

DOI

Author’s competing interest statement

Ethics

Share