DrugSynthMC: an atom based generation of drug-like molecules with Monte Carlo Search

Milo Roucairol; Alexios Georgiou; Tristan Cazenave; Filippo Prischi; Olivier E. Pardo

doi:10.26434/chemrxiv-2024-l2969

Biological and Medicinal Chemistry

Search within Biological and Medicinal Chemistry

DrugSynthMC: an atom based generation of drug-like molecules with Monte Carlo Search

29 May 2024, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

A growing number of Deep Learning (DL) methodologies have recently been developed to design novel compounds and expand the chemical space within virtual libraries. Most of these Neural Network approaches design molecules to specifically bind a target, based on its structural information and/or knowledge of previously identified binders. Fewer attempts have been made to develop approaches for de novo design of virtual libraries, as synthesizability of generated molecules remains a challenge. In this work, we developed a new Monte Carlo Search (MCS) algorithm, DrugSynthMC (Drug Synthetise using Monte Carlo), in conjunction with DL and statistical-based priors to generate thousands of interpretable chemical structures and novel drug-like molecules per second. DrugSynthMC produces drug-like compounds using an atom-based search model that builds molecules as SMILES, character by character. Designed molecules follow Lipinski’s “rule of 5”, show a high proportion of predicted-to-be synthesisable compounds and efficiently expand the chemical space within the libraries, without reliance on training datasets, synthesizability metrics or enforcing during SMILES generation. Our approach can function with or without an underlying Neural Network and is thus easily explainable and versatile. This ease in drug-like molecule generation allows for future integration of score functions aimed at different target- or job -oriented goals. Thus, DrugSynthMC is expected to enable the functional assessment of large compound libraries covering an extensive novel chemical space, overcoming the limitations of existing drug collections. The software is available at https://github.com/RoucairolMilo/DrugSynthMC

Keywords

Supplementary weblinks

Title

Description

Actions

Title

DrugSynthMC

Description

DrugSynthMC software is available at https://github.com/RoucairolMilo/DrugSynthMC

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

May 29, 2024 Version 1

Metrics

621

235

Views

Downloads

Citations

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv-2024-l2969

Funding

Agence nationale de la recherche

ANR19- P3IA-0001

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

DrugSynthMC: an atom based generation of drug-like molecules with Monte Carlo Search

Authors

Abstract

Keywords

Supplementary weblinks

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share