Highly Efficient Crystallization Studies through Machine Learning and Automation

04 November 2024, Version 2
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Crystallization is an important process in a broad range of industries, though studies on this topic remain complicated. Recently, machine learning has been applied to resolve complex issues in chemistry and material science. Here we present a machine learning model integrated into a robotic platform to effectively propose and automatically perform crystallization experiments for organic small molecules. The model was pretrained on around 140000 simulated data generated by a kinetic model, and fine-tuned by over 7000 experimental data obtained on the automated workstation. The improved prediction accuracy and working efficiency of our integrated platform were presented in case studies compared with the traditional approach by humans. A feature contribution analysis demonstrates that this model provides a holistic data-based perspective of promoting and inhibiting influences to crystallization. This work thereby demonstrates the feasibility of applying machine learning techniques to solid-state studies to reduce cost, boost efficiency and deepen understanding.

Keywords

machine learning
crystallization propensity
organic small molecule crystallization
automated workstation

Supplementary materials

Title
Description
Actions
Title
SI Experimental Data Collection
Description
Partial experimental data for model training.
Actions
Title
SI Comparative Studies
Description
Detailed comparative study data.
Actions
Title
Supplimentary Information
Description
Detail information of experimental methods, workstation technical specs, the results of comparative experiments, the details of KC Model and Xtal2 Model, the feature contribution analyses for evaporation crystallization and comparative case studies.
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.