Efficient Training of Neural Network Potentials for Chemical and Enzymatic Reactions by Continual Learning

Yao-Kun Lei; Kiyoshi Yagi; Yuji Sugita

doi:10.26434/chemrxiv-2024-xkxd5

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Efficient Training of Neural Network Potentials for Chemical and Enzymatic Reactions by Continual Learning

18 October 2024, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

The machine learning (ML) method has emerged as an efficient surrogate for high-level electronic structure theory, offering precision and computational efficiency. However, the construction of a general force field remains challenging due to the vast conformational and chemical space. Training data sets typically cover only a limited region of this space, resulting in poor extrapolation performance. Traditional strategies inadequately address this problem by training models from scratch using both old and new datasets. In addition, model transferability is crucial for general force field construction. Existing ML force fields, designed for closed systems with no external environmental potential, exhibit limited transferability to complex condensed phase systems such as enzymatic reactions, resulting in inferior performance and high memory costs. Our ML/MM model based on the Taylor expansion of the electrostatic operator showed high transferability between reactions in several simple solvents. In this work, we extend the strategy to enzymatic reactions to explore transferability between more complex heterogeneous environments. In addition, we also apply continual learning strategies based on memory datasets to enable autonomous and on-the-fly training on a continuous stream of new data. By combining these two methods, we can construct a more general force field more efficiently.

Supplementary materials

Title

Description

Actions

Title

Supplemental Information

Description

Tables S1-S5 and Figures S1-S6

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Oct 18, 2024 Version 1

Metrics

1,295

317

Views

Downloads

Citations

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv-2024-xkxd5

Funding

RIKEN

Pioneering Project "Biology of Intracellular Environments"

RIKEN

TRIP initiative (RIKEN Quantum)

MEXT

JP19H05645, JP21H05249, JP22H04761

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Efficient Training of Neural Network Potentials for Chemical and Enzymatic Reactions by Continual Learning

Authors

Abstract

Supplementary materials

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share