Abstract
In silico identification of potent protein inhibitors commonly requires prediction of a ligand binding free energy (BFE). Thermodynamics integration (TI) based on molecular dynamics (MD) simulations is a BFE calculation method capable of predicting accurate BFE, but it is computationally expensive and time-consuming. In this work, we developed an efficient automated workflow for identifying compounds with the lowest BFE among thousands of congeneric ligands which requires only hundreds of TI calculations. Automated Machine Learning (AutoML) orchestrated by Active Learning (AL) in AL-AutoML workflow allows unbiased and efficient search for a small set of best performing molecules. We applied this workflow to select inhibitors of the SARS-CoV-2 papain-like protease. Our work resulted in predicting 133 compounds with improved binding affinity among which 16 compounds with better than 100-fold binding affinity improvement. The hit rate obtained here is better than that of traditional projects where molecule selection is guided by an expert medicinal chemist. We demonstrated that a combination of an AL protocol provides at least 20x the common brute force approaches.
Supplementary materials
Title
Supplementary Information
Description
Supplementary text, Figures S1 to S, Tables S1 to S2, SI References
Actions