Gaussian Process Regression (GPR) Method for the Prediction of Rate Coefficients of Gas-phase Reactions in Chemical Ionization Mass Spectrometry

15 December 2022, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Reaction kinetics of chemical ionization mass spectrometry (CI-MS) based ion-molecule reactions is an important component in the quantification of trace-level volatile organic compounds (VOCs). The rate coefficients of such CI-MS reactions are predicted using the Gaussian process regression (GPR) machine learning method from the dipole moment, polarizability, and molecular weight of the molecules, mitigating experimental complexity in CI-MS rate coefficient estimation. GPR can make predictions combining prior knowledge (kernel function) which is considered the heart of the GPR model and provide uncertainty measures over predictions. A suitable kernel combination with proper tuning of parameters can make the Gaussian process more robust and powerful. Various kernel combinations, such as kernel addition and multiplication, are tested in the GPR prediction of rates. A blend of radial basis function (RBF), white noise, and squared exponential kernel performs better, and the predicted rates are in close agreement with the experimental rates. GPR provides an alternative to the capture collision rates and can be useful when there are no experimental data available and/or the available data contain large uncertainty in the rate coefficients.

Keywords

Gaussian process regression
Rate coefficients
Chemical ionization mass spectrometry
Volatile organic compounds
Machine learning

Supplementary materials

Title
Description
Actions
Title
data-exp
Description
Experimental data used for the prediction of rates.
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.