Investigating the Reliability and Interpretability of Machine Learning Frameworks for Chemical Retrosynthesis

Friedrich Hastedt; Rowan M. Bailey; Klaus  Hellgardt; Sophia N. Yaliraki; Ehecatl Antonio del Rio Chanona; Dongda Zhang

doi:10.26434/chemrxiv-2024-qdgnv-v3

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Investigating the Reliability and Interpretability of Machine Learning Frameworks for Chemical Retrosynthesis

12 February 2024, Version 3

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Machine learning models for chemical retrosynthesis have attracted substantial interest in recent years. Unaddressed challenges, particularly the absence of robust evaluation metrics for performance comparison, and the lack of black-box interpretability, obscure model limitations and impede progress in the field. We present an automated benchmarking pipeline designed for effective model performance comparisons. With an emphasis on user-friendly design, we aim to streamline accessibility and facilitate utilisation within the research community. Additionally, we suggest and perform a new interpretability study to uncover the degree of chemical understanding acquired by retrosynthesis models. Our results reveal that frameworks based on chemical reaction rules yield the most diverse, chemically valid, and feasible reactions, whereas purely data-driven frameworks suffer from unfeasible and invalid predictions. The interpretability study emphasises that incorporating reaction rules not only enhances model performance but also improves interpretability. For simple molecules, we demonstrate that Graph Neural Networks identify relevant functional groups within the product molecule, providing thermodynamic stabilisation over the reactant precursors. In contrast, the popular Transformer fails to identify such crucial stabilisation. As the molecule and reaction mechanism grow more complex, both data-driven models propose unfeasible disconnections without offering a chemical rationale. We stress the importance of incorporating chemically meaningful descriptors within deep-learning models. Our study provides valuable guidance for the future development of retrosynthesis frameworks.

Keywords

Chemical Retrosynthesis

Supplementary weblinks

Title

Description

Actions

Title

GitHub Repository

Description

The code repository for reproducing results and benchmarking retrosynthesis algorithms

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Feb 12, 2024 Version 3

Jan 19, 2024 Version 2

Jan 11, 2024 Version 1

Version Notes

Fixing typos with spelling in Figure 1b and SELFIES in Conclusion

Metrics

2,591

1,382

Views

Downloads

Citations

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2024-qdgnv-v3

Funding

Engineering and Physical Sciences Research Council

EP/S023232/1

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Investigating the Reliability and Interpretability of Machine Learning Frameworks for Chemical Retrosynthesis

Authors

Abstract

Keywords

Supplementary weblinks

Comments

Version History

Version Notes

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share