Graph Neural Networks for CO2 Solubility Predictions in Deep Eutectic Solvents

02 April 2024, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Deep Eutectic Solvents (DESs) are a promising class of solvents for CO2 capture. DESs are complex mixtures that can be designed to optimize CO2 solubility and overall capture process efficiency. However, the vast design landscape of DES mixtures makes experimental investigation prohibitive; as such, there is a need for computational models that can quickly and efficiently navigate the design space and inform data collection efforts. In this work, we propose Graph Neural Network (GNN) models for predicting CO2 solubility directly from the molecular structure of its constituents. The GNN leverages a mixture graph representation that captures the molecular structure of the DES components as well as their intermolecular interactions. We compare the GNN framework against alternative architectures (neural networks, graph convolution networks, and random forests) and data representations (molecular fingerprints, sigma profiles, pseudo-component graphs). We show that the proposed approach offers superior predictive performance; specifically, we show that solubility can be predicted reliably directly from molecular structure (without the need of using sigma profiles as proposed in previous studies). This result is important, as obtaining sigma profiles requires expensive density functional theory computations. We also explored the ability of GNNs to predict solubility for new DES mixtures and operating conditions. We found that the model extrapolates across temperature, pressure, and molar ratios reliably. However, we also found deficiencies in the ability of the model to predict solubility for DES mixtures not included in the training set; we show that this is due to an inherent lack of chemical diversity in datasets available in the literature. The proposed computational capabilities can help navigate the design space of DES and inform data collection efforts. Our models, data, and benchmarks are shared as Python code implemented in Jupyter notebooks.

Keywords

carbon capture
machine learning
deep eutectic solvents

Supplementary materials

Title
Description
Actions
Title
Supplementary Information
Description
Contains details on computational framework
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.