A Bag of Tricks for Automated De Novo Design of Molecules with the Desired Properties: Application to EGFR Inhibitor Discovery

Maria Korshunova; Niles Huang; Stephen Capuzzi; Dmytro S. Radchenko; Olena Savych; Yuriy S. Moroz; Carrow Wells; Timothy M. Willson; Alexander Tropsha; Olexandr Isayev

doi:10.26434/chemrxiv.14045072.v1

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

A Bag of Tricks for Automated De Novo Design of Molecules with the Desired Properties: Application to EGFR Inhibitor Discovery

24 February 2021, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Deep generative neural networks have been used increasingly in computational chemistry for de novo design of molecules with desired properties. Many deep learning approaches employ reinforcement learning for optimizing the target properties of the generated molecules. However, the success of this approach is often hampered by the problem of sparse rewards as the majority of the generated molecules are expectedly predicted as inactives. We propose several technical innovations to address this problem and improve the balance between exploration and exploitation modes in reinforcement learning. In a proof-of-concept study, we demonstrate the application of the deep generative recurrent neural network enhanced by several novel technical tricks to designing experimentally validated potent inhibitors of the epidermal growth factor (EGFR). The proposed technical solutions are expected to substantially improve the success rate of finding novel bioactive compounds for specific biological targets using generative and reinforcement learning approaches.

Keywords

de novo design

generative models

Molecular Design

Generative models for molecules

drug discovery

Machine Learning

Supplementary materials

Title

Description

Actions

Title

egfr supplementary

Description

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Feb 24, 2021 Version 1

Metrics

3,548

1,224

Views

Downloads

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv.14045072.v1

Funding

Directorate for Mathematical & Physical Sciences

2041108

D3SC: CDS&E: Collaborative Research: Development and application of accurate, transferable and extensible deep neural network potentials for molecules and reactions

https://app.dimensions.ai/details/grant/grant.9383908

Directorate for Mathematical & Physical Sciences

1802789

D3SC: CDS&E: Collaborative Research: Development and application of accurate, transferable and extensible deep neural network potentials for molecules and reactions

https://app.dimensions.ai/details/grant/grant.7671608

NIH 1U01CA207160

ONR N00014-16-1-2311

OAC-1818253

Author’s competing interest statement

N/A

A Bag of Tricks for Automated De Novo Design of Molecules with the Desired Properties: Application to EGFR Inhibitor Discovery

Authors

Abstract

Keywords

Supplementary materials

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Share