Practically significant method comparison protocols for machine learning in small molecule drug discovery.

Jeremy R. Ash; Cas Wognum; Raquel  Rodríguez-Pérez; Matteo Aldeghi; Alan C. Cheng; Djork-Arné Clevert; Ola Engkvist; Cheng Fang; Daniel J.  Price; Jacqueline M.  Hughes-Oliver; W. Patrick  Walters

doi:10.26434/chemrxiv-2024-6dbwv-v2

Biological and Medicinal Chemistry

Search within Biological and Medicinal Chemistry

Practically significant method comparison protocols for machine learning in small molecule drug discovery.

07 November 2024, Version 2

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Machine Learning (ML) methods that relate molecular structure to properties are frequently proposed as in-silico surrogates for expensive or time-consuming experiments. In small molecule drug discovery, such methods inform high-stakes decisions like compound synthesis and in-vivo studies. This application lies at the intersection of multiple scientific disciplines. When comparing new ML methods to baseline or state-of-the-art approaches, statistically rigorous method comparison protocols and domain-appropriate performance metrics are essential to ensure replicability and ultimately the adoption of ML in small molecule drug discovery. This paper proposes a set of guidelines to incentivize rigorous and domain-appropriate techniques for method comparison tailored to small molecule property modeling. These guidelines, accompanied by annotated examples and open-source software tools, lay a foundation for robust ML benchmarking and thus the development of more impactful methods.

Keywords

Method Comparison

Practical Significance

Supplementary weblinks

Title

Description

Actions

Title

Code

Description

All code associated with this paper has been made available in this Github repository.

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

First, thanks for putting this together. Long overdue, and I am excited to move away from the dreaded bold table. I'm new to rigorous stats-world, so please forgive me if the below is totally off-base. My question is related to the suggestion that repeated random sampling is undesirable. I prefer this method since (I believe) it rigorously permits parametric testing for comparisons and because it allows using more advanced splitting methods (fingerprint based clustering and partitioning, for example) without having to worry about rigorously 'striping' through the data. From section 3.1.2 (v2): "Commonly used alternatives to CV like bootstrapping and repeated ran- dom splits of the data have also been shown to result in strong dependency between samples and are generally not recommended [13]." Where reference 13 is " Bates, S., Hastie, T. & Tibshirani, R. Cross-validation: What does it estimate and how well does it do it? Journal of the American Statistical Association 119, 1434–1445 (2023). URL http://dx.doi.org/10.1080/01621459.2023.2197686" (1) Where in this paper is this claim? (2) I find it unintuitive that repeated random splits would result in strong dependency, especially given the the suggested Repeated CV is very similar. Repeated random sampling is basically just Repeated CV (5x2) but without the x2 part (?).

Response,
Jeremy Ash :

Dec 17, 2024, 02:34

Response here: https://github.com/polaris-hub/polaris-method-comparison/discussions/9

Hello, I would be interested to know when this gets published somewhere.

Response,
Cas Wognum :

Nov 10, 2024, 15:59

Hey Francois, thanks for reaching out! We're sharing this work as a preprint to seek feedback from the community on the proposed guidelines. Afterwards we intend to submit the paper for publication to a peer-reviewed journal early next year. The best way to share your feedback would be as a Github Discussion here: https://github.com/polaris-hub/polaris-method-comparison/discussions . I hope that helps!

Version History

Nov 07, 2024 Version 2

Nov 04, 2024 Version 1

Version Notes

We accidentally left out part of Appendix D. We also increased the quality of Figure 1 and 3 by using SVG. We improved the clarity of Figure 8. We clarified the caption of Figure 7. We added a relevant citation in the introduction.

Metrics

8,195

4,678

Views

Downloads

License

The content is available under CC BY NC 4.0

DOI

10.26434/chemrxiv-2024-6dbwv-v2

Author’s competing interest statement

Except for Professor Hughes-Oliver, all authors were employed by for-profit companies during the writing of this article. While there may be financial or non-financial interests related to their employer, the authors affirm their commitment to scientific integrity. The article is presented objectively, and steps were taken to minimize any potential influence from their employment. The corresponding author is available for further inquiries.

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Practically significant method comparison protocols for machine learning in small molecule drug discovery.

Authors

Abstract

Keywords

Supplementary weblinks

Comments

Response, Jeremy Ash : Dec 17, 2024, 02:34

Response, Cas Wognum : Nov 10, 2024, 15:59

Version History

Version Notes

Metrics

License

DOI

Author’s competing interest statement

Ethics

Share

Response,
Jeremy Ash :

Dec 17, 2024, 02:34

Response,
Cas Wognum :

Nov 10, 2024, 15:59