MOFTransformer: A Multi-modal Pre-training Transformer for Universal Transfer Learning in Metal-Organic Frameworks

Yeonghun Kang; Hyunsoo Park; Berend Smit; Jihan Kim

doi:10.26434/chemrxiv-2022-hcjzc

Materials Science

Search within Materials Science

MOFTransformer: A Multi-modal Pre-training Transformer for Universal Transfer Learning in Metal-Organic Frameworks

20 October 2022, Version 1

This is not the most recent version. There is a

newer version

of this content available

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

In this work, we introduce MOFTransformer, a multi-model Transformer encoder pre-trained with 1 million hypothetical MOFs. The multi-modal model uses an integrated atom-based graph and energy-grid embeddings to capture both the local and global features of the MOFs, respectively. By fine-tuning the pre-trained model with small datasets (from 5,000 to 20,000), our model outperforms all other machine learning models across various properties that include gas adsorption, diffusion, electronic properties, and even text mined data. Beyond its universal transfer learning capabilities, MOFTransformer generates chemical insight by analyzing feature importance from attention scores within the self-attention layers. As such, this model can serve as a bedrock platform for other MOF researchers that seek to develop new machine learning models for their work.

Keywords

MOFs

metal-organic framework

universal transfer learning

Supplementary materials

Title

Description

Actions

Title

Supplementary Information

Description

Supplementary Notes 1-5, Figures 1-12, Tables 1 and reference.

Actions

Supplementary weblinks

Title

Description

Actions

Title

Github - MOFTransforer

Description

github links for MOFtransformer

Actions

View

Title

Docs

Description

Documents for MOFTransformer

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Jan 18, 2023 Version 2

Oct 20, 2022 Version 1

Metrics

5,783

3,479

Views

Downloads

Citations

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv-2022-hcjzc

Funding

National Research Foundation of Korea

2021M3A7C208974513

National Supercomputing Center with supercomputing resources including technical support

KSC-2021-CRE-0460

ACT programme

Horizon2020 Project No 294766

NERC

EPSRC

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

MOFTransformer: A Multi-modal Pre-training Transformer for Universal Transfer Learning in Metal-Organic Frameworks

Authors

Abstract

Keywords

Supplementary materials

Supplementary weblinks

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share