Integrating Machine Learning-Based Pose Sampling with Established Scoring Functions for Virtual Screening

Thi Ngoc Lan Vu; Hosein Fooladi; Johannes Kirchmair

doi:10.26434/chemrxiv-2025-96kzg

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Integrating Machine Learning-Based Pose Sampling with Established Scoring Functions for Virtual Screening

18 February 2025, Version 1

This is not the most recent version. There is a

newer version

of this content available

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Abstract Classical docking methods have dominated the field of structure-based virtual screening (VS) for decades. Recently, several machine learning (ML)-based docking approaches have been introduced, presenting a promising avenue for advancing VS technologies. In this work, we report on the integration of DiffDock-L, one of the most promising ML-based pose sampling methods, into VS workflows by combining it with the established Vina and Gnina scoring functions. We assess the integrated approach regarding its VS effectiveness, pose sampling quality, and complementarity to classical docking methods, represented by AutoDock Vina. Our results on the DUD-Z benchmark data set show that pose sampling with DiffDock-L and AutoDock Vina yields comparable performance. In contrast, the choice of the scoring function has a decisive impact on VS success. In general, DiffDock-L generates physically plausible and biologically relevant poses in most cases, confirming it as a viable alternative to classical docking algorithms.

Keywords

Supplementary materials

Title

Description

Actions

Title

Supporting Information

Description

Contains additional details on parameters used in executing the docking programs, statistics of the processed molecules, correlation analyses for docking scores, validity and plausibility analyses of docking poses, and statistics on protein-ligand interaction profiles of the docking poses and reference ligands for individual targets (PDF).

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Feb 24, 2025 Version 2

Feb 18, 2025 Version 1

Metrics

501

294

Views

Downloads

Citations

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2025-96kzg

Funding

Austrian Federal Ministry of Labour and Economy

Austrian National Foundation for Research, Technology and Development

Christian Doppler Research Association

Boehringer-Ingelheim RCV GmbH & Co KG

BASF SE

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Integrating Machine Learning-Based Pose Sampling with Established Scoring Functions for Virtual Screening

Authors

Abstract

Keywords

Supplementary materials

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share