Learning from Docked Ligands: Ligand-Based Features Rescue Structure-Based Scoring Functions When Trained On Docked Poses

Fergus Boyles; Charlotte M Deane; Garrett Morris

doi:10.26434/chemrxiv.13637756.v1

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Learning from Docked Ligands: Ligand-Based Features Rescue Structure-Based Scoring Functions When Trained On Docked Poses

27 January 2021, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Machine learning scoring functions for protein-ligand binding affinity have been found to consistently outperform classical scoring functions when trained and tested on crystal structures of bound protein-ligand complexes. However, it is less clear how these methods perform when applied to docked poses of complexes.

We explore how the use of docked, rather than crystallographic, poses for both training and testing affects the performance of machine learning scoring functions. Using the PDBbind Core Sets as benchmarks, we show that the performance of a structure-based machine learning scoring function trained and tested on docked poses is lower than that of the same scoring function trained and tested on crystallographic poses. We construct a hybrid scoring function by combining both structure-based and ligand-based features, and show that its ability to predict binding affinity using docked poses is comparable to that of purely structure-based scoring functions trained and tested on crystal poses. Despite strong performance on docked poses of the PDBbind Core Sets, we find that our hybrid scoring function fails to generalise to anew data set, demonstrating the need for improved scoring functions and additional validation benchmarks.

Code and data to reproduce our results are available from https://github.com/oxpig/learning-from-docked-poses.

Keywords

Machine Learning Predictions

protein-ligand binding affinity prediction

Ligand docking

Supplementary materials

Title

Description

Actions

Title

learning from docks SI

Description

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Jan 27, 2021 Version 1

Metrics

2,783

801

Views

Downloads

Citations

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv.13637756.v1

Funding

Engineering and Physical Sciences Research Council

EP/G03706X/1

Systems Biology Doctoral Training Centre

https://app.dimensions.ai/details/grant/grant.3559983

Author’s competing interest statement

no conflict of interest

Learning from Docked Ligands: Ligand-Based Features Rescue Structure-Based Scoring Functions When Trained On Docked Poses

Authors

Abstract

Keywords

Supplementary materials

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Share