Deep Learning Metal Complex Properties with Natural Quantum Graphs

Hannes Kneiding; Ruslan Lukin; Lucas Lang; Simen Reine; Thomas Bondo Pedersen; Riccardo De Bin; David Balcells

doi:10.26434/chemrxiv-2022-fd43k-v2

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Deep Learning Metal Complex Properties with Natural Quantum Graphs

09 November 2022, Version 2

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Machine learning can make a strong contribution to accelerating the discovery of transition metal complexes (TMC). These compounds will play a key role in the development of new technologies for which there is an urgent need, including the production of green hydrogen from renewable sources. Despite the recent developments in machine learning for drug discovery and organic chemistry in general, the application of these methods to TMCs remains challenged by their higher complexity and the limited availability of large datasets. In this work, we report a representation for deep graph learning on TMCs – the natural quantum graph (NatQG), which leverages the electronic structure data available from natural bond orbital (NBO) analysis. This data was used to define both the topology and the information expressed by the NatQG graphs. At the topology level, two different NatQG flavors were developed: u-NatQG, with undirected edges, and d-NatQG, with edges directed along donor → acceptor orbital interactions. At the information level, the node and edge attribute vectors of both graphs contain NBO data, including natural charges and bond orders. The NatQG graphs were used to develop graph neural networks (GNNs) for the prediction of the quantum properties underlying the structure and reactivity of TMCs (e.g. HOMO-LUMO gap and polarizability). These models surpassed baselines based on traditional descriptors and performed at a level similar to, or higher than, state-of-the-art GNNs based on radial cutoffs. The results showed that the electronic structure information encoded by the models has a stronger impact on its accuracy than the geometric information. With the aim of benchmarking the GNNs, we also developed the transition metal quantum mechanics graph dataset (tmQMg), which provides the geometries, properties, and NatQG graphs of 60k TMCs.

Keywords

graph neural networks

transition metal complexes

Supplementary materials

Title

Description

Actions

Title

Supporting Information

Description

Further information on the statistics of the tmQMg dataset and its outliers. Technical details of the GNN models, the baseline representation, and the linear fitting of the atomic energies used to predict energy targets. The error metrics obtained with the training dataset, the Python libraries used to develop the HyDGL code, and the computational details of the tmQMg dataset are also provided.

Actions

Supplementary weblinks

Title

Description

Actions

Title

HyDGL code

Description

HyDGL is a Python parser for generating descriptive graphs from Natural Bond Orbital data ready for use in Graph Neural Networks.

Actions

View

Title

tmQMg dataset

Description

Repository for the tmQMg dataset files and analysis scripts.

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Deep learning metal complex properties with natural quantum graphs

Hannes Kneiding, Ruslan Lukin, Lucas Lang, Simen Reine, Thomas Bondo Pedersen, Riccardo De Bin, David Balcells journal article

Digital Discovery , Volume 2, Issue 3

Online publication date: 2023

Version History

Nov 09, 2022 Version 2

Jun 28, 2022 Version 1

Version Notes

The predictions were extended to several other properties, including extensive energies. A baseline graph representation was included for benchmarking purposes. Further, the impacts of including geometric and electronic structure information were assessed and compared. Access to open code and data was also added.

Metrics

2,417

1,153

Views

Downloads

Citations

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv-2022-fd43k-v2

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Deep Learning Metal Complex Properties with Natural Quantum Graphs

Authors

Abstract

Keywords

Supplementary materials

Supplementary weblinks

Comments

Now Published

Version History

Version Notes

Metrics

License

DOI

Author’s competing interest statement

Ethics

Share