Automated electrosynthesis reaction mining with multimodal large language models (MLLMs)

Shi Xuan  Leong; Sergio Pablo-García; Zijian Zhang; Alán  Aspuru-Guzik

doi:10.26434/chemrxiv-2024-7fwxv

Organic Chemistry

Search within Organic Chemistry

Automated electrosynthesis reaction mining with multimodal large language models (MLLMs)

11 July 2024, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Leveraging the chemical data that is available in legacy formats such as publications and patents is a significant challenge for the community. Automated reaction mining offers a promising solution to unleash this knowledge into a learnable digital form and therefore help expedite materials and reaction discovery. However, existing reaction mining toolkits are limited to single input modalities (text or images) and cannot effectively integrate heterogeneous data that is scattered across different modalities including text, tables, and figures. In this work, we go beyond single input modalities and explore multimodal large language models (MLLMs) for the analysis of diverse data inputs for automated electrosynthesis reaction mining. We compiled a test dataset of 65 articles and employed it to benchmark five prominent MLLMs against two critical tasks: (i) reaction diagram parsing and (ii) resolving cross-modality data interdependencies. The frontrunner MLLM achieved ≥ 96% accuracy in both tasks, with the strategic integration of single-shot visual prompts and image pre-processing techniques. We integrate this capability into a toolkit named MERMES (Multimodal Reaction Mining pipeline for ElectroSynthesis). Our toolkit functions as an end-to-end MLLM-powered pipeline that integrates article retrieval, information extraction and multimodal analysis for streamlining and automating knowledge extraction. This work lays the groundwork for the increased utilization of MLLMs to accelerate the digitization of chemistry knowledge for data-driven research.

Keywords

large language models

Supplementary materials

Title

Description

Actions

Title

Supplementary Information

Description

Supplementary Information including: 1) prompts, 2) Error analysis, 3) List of DOIs, and 4) Evaluation tables.

Actions

Supplementary weblinks

Title

Description

Actions

Title

First release of MERMES

Description

Frozen code repository of MERMES

Actions

View

Title

Response files and prompts

Description

Prompt files and raw data responses from different MLLMs

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Automated electrosynthesis reaction mining with multimodal large language models (MLLMs)

Shi Xuan Leong, Sergio Pablo-García, Zijian Zhang, Alán Aspuru-Guzik journal article

Chemical Science , Volume 15, Issue 43

Online publication date: 2024

Version History

Jul 11, 2024 Version 1

Metrics

1,476

1,060

Views

Downloads

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv-2024-7fwxv

Funding

Canada First Research Excellence Fund

Canada 150 Research Chairs Program

Canadian Institute for Advanced Research

Anders G. Fröseth

Acceleration Consortium

U.S. Department of Energy, Office of Science

Nanyang Technological University, Singapore

Overseas Postdoctoral Fellowship

Ministry of Education, Singapore

Overseas Postdoctoral Fellowship

Author’s competing interest statement

A.A.-G is a founder of Kebotix, Inc., Axiomatic, Inc., and Intrepid Labs, Inc.

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Automated electrosynthesis reaction mining with multimodal large language models (MLLMs)

Authors

Abstract

Keywords

Supplementary materials

Supplementary weblinks

Comments

Now Published

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share