Generality-driven optimization of enantio- and regioselective catalysis by high-throughput experimentation and machine learning

23 April 2025, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

The longstanding quest for achieving substrate generality stems from the unpredictability of single-model optimization. Leveraging the potential of high-throughput experimentation (HTE), we present a practical multi-substrate screening strategy that enables the general asymmetric mono-reduction of unsymmetrical 1,2-dicarbonyl compounds. This study combines quantitative 1H NMR with simultaneous chiral analysis facilitated by 19F NMR for pooled samples without any purification steps. Remarkably, this approach accelerated the workflow by eight times compared to the traditional method, enabling robust screening of 31 chiral oxazaborolidinium ion (COBI) variants across eight 1,2-dicarbonyl compounds. Extensive substrate screening successfully addressed even the most challenging substrates with subtle steric differentiation, such as Et/Me. Moreover, the HTE data were utilized in a machine learning (ML) model with descriptors based on the CGR (Condensed Graphs of Reaction), which identified catalysts with enhanced reactivity for previously unexplored substrates, independent of quantum chemical calculations. In this approach, the ARMS (Automated Reaction Mapping for various Substituents) system was introduced to streamline SMILES (Simplified Molecular Input Line Entry System) preprocessing for multi-substrate reactions, improving data efficiency and enabling CGR generation. The resulting chiral α-silyloxy ketones, obtained in excellent yields (up to >99%) with excellent selectivities (up to >99% ee, up to >20:1 r.r.), were then easily transformed to form various natural products and pharmaceuticals, such as (S)-bupropion—originally sold as a racemic mixture but now available in its chiral form.

Keywords

high-throughput experimentation
machine learning
organocatalysis
enantioselectivity
regioselectivity

Supplementary materials

Title
Description
Actions
Title
Generality-driven optimization of enantio- and regioselective catalysis by high-throughput experimentation and machine learning
Description
Infrared spectra were recorded on a Bruker Vertex 70. HRMS were recorded a Supercritical Fluid Chromatograph combined with Xevo G2-XS QTOF Mass Spectrometer (Waters, Milford, MA, USA). Analytical high performance liquid chromatography (HPLC) was performed on YL 9100 HPLC systemusing the indicated chiral column (25 cm). Optical rotations were determined on a Jasco P-1020 at 589 nm. Melting point was identified with Buchi Melting Point M-560. Analytical data were recorded using NMR spectrometer (1H and 13C), Infrared spectrometer, HPLC, polarimeter, and melting point apparatus at the Chiral Material Core Facility Center of Sungkyunkwan University. 19F{1H} NMR spectra were recorded with equipment from the Department of Chemistry of Korea Advanced Institute of Science and Technology. Single-crystal diffraction were collected with a Bruker D8 VENTURE diffractometer at Gyeongsang National University Center for Research Facilities.
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.