Learning from the Ligand: Using Ligand-Based Features to Improve Binding Affinity Prediction

Fergus Boyles; Charlotte M Deane; Garrett Morris

doi:10.26434/chemrxiv.8174525.v1

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Learning from the Ligand: Using Ligand-Based Features to Improve Binding Affinity Prediction

23 May 2019, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Machine learning scoring functions for protein-ligand binding affinity prediction have been found to consistently outperform classical scoring functions. Structure-based scoring functions for universal affinity prediction typically use features describing interactions derived from the protein-ligand complex, with limited information about the chemical or topological properties of the ligand itself. We demonstrate that the performance of machine learning scoring functions are consistently improved by the inclusion of diverse ligand-based features. For example, a Random Forest combining the features of RF-Score v3 with RDKit molecular descriptors achieved Pearson correlation coefficients of up to 0.831, 0.785, and 0.821 on the PDBbind 2007, 2013, and 2016 core sets respectively, compared to 0.790, 0.737, and 0.797 when using the features of RF-Score v3 alone. Excluding proteins and/or ligands that are similar to those in the test sets from the training set has a significant effect on scoring function performance, but does not remove the predictive power of ligand-based features. Furthermore a Random Forest using only ligand-based features is predictive at a level similar to classical scoring functions and it appears to be predicting the mean binding affinity of a ligand for its protein targets.

Keywords

machine Learning Predictions

Scoring Function

Random Forest Approach

RDKit

molecular descriptors

comparison of methods

Supplementary materials

Title

Description

Actions

Title

suppinfo

Description

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Learning from the ligand: using ligand-based features to improve binding affinity prediction

Fergus Boyles, Charlotte M Deane, Garrett M Morris journal article

Bioinformatics

Online publication date: Aug 26, 2019

Version History

May 23, 2019 Version 1

Metrics

3,392

800

Views

Downloads

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv.8174525.v1

Funding

EPSRC EP/G03706X/1

Author’s competing interest statement

no conflict of interest

Learning from the Ligand: Using Ligand-Based Features to Improve Binding Affinity Prediction

Authors

Abstract

Keywords

Supplementary materials

Comments

Now Published

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Share