Deep Generative Model for the Dual-Objective Inverse Design of Metal Complexes

Magnus Strandgaard; Trond Linjordet; Hannes Kneiding; Arron Burnage; Ainara Nova; Jan Halborg Jensen; David Balcells

doi:10.26434/chemrxiv-2024-mzs7b

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Deep Generative Model for the Dual-Objective Inverse Design of Metal Complexes

29 May 2024, Version 1

This is not the most recent version. There is a

newer version

of this content available

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Deep generative models yielding transition metal complexes (TMCs) remain scarce despite the key role of these compounds in industrial catalytic processes, anticancer therapies, and energy transformations. Compared to drug discovery within the organic molecular space, TMCs pose further challenges including the encoding of chemical bonds of higher complexity and the optimization of multiple properties, in a context in which synthesizability is affected by additional, complex factors. In this work, we developed a junction tree variational autoencoder (JT-VAE) model for the generation of metal ligands. After implementing a SMILES-based encoding of the metal–ligand bonds, the model was trained with the tmQMg-L ligand library, allowing for the random generation of thousands of monodentate and bidentate ligands with full validity and high novelty. The generated ligands were labeled with two target properties of the associated [IrL4]+ and [IrL2]+ homoleptic TMCs; namely the HOMO-LUMO gap (ϵ) and the metal charge (qIr), both computed at a DFT level. This data was used to implement a conditional JT-VAE model generating ligands from a prompt, with the single or dual objective of optimizing either one or both properties in Y = (ϵ, qIr). Conditional ligand generation was able to navigate both central and extreme regions of this bidimensional property space, allowing for chemical interpretation based on the step-wise analysis of the decoded optimization trajectories.

Keywords

inverse design

deep learning

variational autoencoders

metal ligands

transition metal complexes

generative models

Supplementary materials

Title

Description

Actions

Title

Supporting Information

Description

The supporting information provides further details on library versions, curation of the training SMILES, ligand encodings, coordination environments, synthetic accessibility, latent space analysis, Cartesian coordinates generation from SMILES, DFT calculations, outlier analysis, and JT-VAE model details for both the unconditional and conditional generative tasks.

Actions

Supplementary weblinks

Title

Description

Actions

Title

Repository

Description

Data and code repository.

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Jan 27, 2025 Version 2

May 29, 2024 Version 1

Metrics

1,374

673

Views

Downloads

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv-2024-mzs7b

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Deep Generative Model for the Dual-Objective Inverse Design of Metal Complexes

Authors

Abstract

Keywords

Supplementary materials

Supplementary weblinks

Comments

Version History

Metrics

License

DOI

Author’s competing interest statement

Ethics

Share