Exploring Deep Learning for Metalloporphyrins: Databases, Molecular Representations, and Model Architectures

An Su; Chengwei Zhang; Yuanbin She; Yun-Fang Yang

doi:10.26434/chemrxiv-2022-sq6dg-v2

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Exploring Deep Learning for Metalloporphyrins: Databases, Molecular Representations, and Model Architectures

11 October 2022, Version 2

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Metalloporphyrins have been studied as biomimetic catalysts for more than 120 years and have accumulated a large amount of data, which provides a solid foundation for deep learning to discover chemical trends and structure-function relationships. In this study, key components of deep learning of metalloporphyrins, including databases, molecular representations, and model architectures, were systematically investigated. A protocol to construct canonical SMILES for metalloporphyrins was proposed, which was then used to represent the two-dimensional structures of over 10,000 metalloporphyrins in an existing computational database. Subsequently, several state-of-the-art chemical deep learning models, including graph neural network-based models and natural language processing-based models, were employed to predict the energy gaps of metalloporphyrins. Two models showed satisfactory predictive performance (R2>0.94) with canonical SMILES as the only source of structural information. In addition, an unsupervised visualization algorithm was used to interpret the molecular features learned by the deep learning models.

Keywords

Metalloporphyrin

Database

Molecular representation

Deep learning

Property prediction

Supplementary materials

Title

Description

Actions

Title

Supporting Information

Description

Supplementary figures, tables, and discussions.

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Exploring Deep Learning for Metalloporphyrins: Databases, Molecular Representations, and Model Architectures

An Su, Chengwei Zhang, Yuan-Bin She, Yun-Fang Yang journal article

Catalysts , Volume 12, Issue 11

Online publication date: Nov 21, 2022

Version History

Oct 11, 2022 Version 2

Sep 21, 2022 Version 1

Version Notes

1. New references were added to the introduction. 2. Table of contents was added. 3. Minor format changes were made.

Metrics

1,686

429

Views

Downloads

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv-2022-sq6dg-v2

Funding

National Natural Science Foundation of China

22108252

National Natural Science Foundation of China

21978272

Fundamental Research Funds for the Provincial Universities of Zhejiang

RF-B2020006

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Exploring Deep Learning for Metalloporphyrins: Databases, Molecular Representations, and Model Architectures

Authors

Abstract

Keywords

Supplementary materials

Comments

Now Published

Version History

Version Notes

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share