Integrating Machine Learning-Based Pose Sampling with Established Scoring Functions for Virtual Screening

18 February 2025, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Abstract Classical docking methods have dominated the field of structure-based virtual screening (VS) for decades. Recently, several machine learning (ML)-based docking approaches have been introduced, presenting a promising avenue for advancing VS technologies. In this work, we report on the integration of DiffDock-L, one of the most promising ML-based pose sampling methods, into VS workflows by combining it with the established Vina and Gnina scoring functions. We assess the integrated approach regarding its VS effectiveness, pose sampling quality, and complementarity to classical docking methods, represented by AutoDock Vina. Our results on the DUD-Z benchmark data set show that pose sampling with DiffDock-L and AutoDock Vina yields comparable performance. In contrast, the choice of the scoring function has a decisive impact on VS success. In general, DiffDock-L generates physically plausible and biologically relevant poses in most cases, confirming it as a viable alternative to classical docking algorithms.

Keywords

docking
machine learning
DiffDock
DiffDock-L
AutoDock Vina
Benchmarking

Supplementary materials

Title
Description
Actions
Title
Supporting Information
Description
Contains additional details on parameters used in executing the docking programs, statistics of the processed molecules, correlation analyses for docking scores, validity and plausibility analyses of docking poses, and statistics on protein-ligand interaction profiles of the docking poses and reference ligands for individual targets (PDF).
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.