An extensible platform for enabling artificial intelligence guided design of catalysts and materials

13 February 2023, Version 2
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Advances in artificial intelligence (AI), machine learning (ML), and automated experimentation are poised to vastly accelerate and reshape polymer science research. The difficulties of experimental data and polymer structure representation form a significant barrier in developing foundational AI models in polymer chemistry. These issues are further compounded by a lack of accessible and extensible software tools for experimental researchers to effectively leverage their historical data for AI development. To address these difficulties, we developed an extensible domain specific language, termed Chemical Markdown Language (CMDL), to enable flexible and consistent representation of disparate experiment types. CMDL can be extended to cover new data types and provides built-in support for the graph representation of polymers and automated experimentation platforms such as continuous flow reactors facilitating their ingestion into AI/ML pipelines. CMDL enabled seamless use of historical experimental data to train new AI models for catalyst and materials design. Here, the utility of this approach was first demonstrated through the successful synthesis and experimental validation of novel AI-generated ring-opening polymerization catalysts. Next, we derived a first-of-kind generative model for block and statistical co-polymer structures from their CMDL graph representations. This model preserved critical functional groups within the polymer structure, allowing them to be readily validated experimentally. These results reveal how the versatility of CMDL facilitates translation of historical experimental data into meaningful predictive and generative models which produce experimentally actionable output.

Keywords

accelerated discovery
polymer informatics
artificial intelligence

Supplementary materials

Title
Description
Actions
Title
Supporting Information
Description
Supporting information and figures for manuscript.
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.