Abstract
We report a multivariate linear regression model able to make accurate predictions for the rate and regioselectivity of nucleophilic aromatic substitution (SNAr) reactions based on the electrophile structure. This model uses a diverse training/test set from experimentally-determined relative SNAr rates between benzyl alcohol and 74 unique electrophiles, including heterocycles with multiple substitution patterns. There is a robust linear relationship between the experimental SNAr free energies of activation and three molecular descriptors that can be obtained computationally: the LUMO energy of the electrophile; the average molecular electrostatic potential (ESP) at the carbon undergoing substitution; and the sum of average ESP values for the ortho and para atoms relative to the reactive center. Despite using only simple descriptors calculated from ground state wavefunctions, this model demonstrates excellent correlation with previously measured SNAr reaction rates, and is able to accurately predict site selectivity for multihalogenated substrates: 91% prediction accuracy across 82 individual examples. The excellent agreement between predicted and experimental outcomes makes this easy-to-implement reactivity model a potentially powerful tool for synthetic planning.
Supplementary materials
Title
Supplementary Information
Description
Experimental and computational details, full tables of descriptor values and experimental results, spectroscopic data, and model validation studies.
Actions
Title
Supplementary Data Tables
Description
All molecular descriptor and experimental data used to generate multivariate models and perform validations. Microsoft Excel format.
Actions
Title
Supplementary Computational Files
Description
Zip file containing all coordinate files (xyz format) for computed structures and transition states.
Actions