Times are changing but order matters: Transferable prediction of   small molecule liquid chromatography retention times

Fleming Kretschmer; Eva-Maria Harrieder; Michael Witting; Sebastian Böcker

doi:10.26434/chemrxiv-2024-wd5j8

Analytical Chemistry

Search within Analytical Chemistry

Times are changing but order matters: Transferable prediction of small molecule liquid chromatography retention times

23 December 2024, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Thousands of publications on the prediction of small molecule retention times were published during the last decades. The ultimate goal is, without doubt, the transferable prediction of retention times: We want to train a model on a certain set of compounds from one dataset and then use the model to predict retention times for a different set of compounds from another dataset. Unfortunately, retention times may change massively, even for nominally identical chromatographic conditions. Retention order is much better retained, yet even the retention order of compounds may change if chromatographic conditions vary. Here, we systematically study what chromatographic conditions result in notable changes in retention order. We then present a machine learning model that can predict retention order or, more precisely, a retention order index, taking into account chromatographic conditions. Finally, we show how to map the retention order index to retention times. Disentangling these two task finally enables retention time prediction across chromatographic conditions and compound classes.

Keywords

Retention Time Prediction

Metabolomics

Liquid Chromatography

Supplementary materials

Title

Description

Actions

Title

Supplementary Table 2. List of RepoRT datasets used for retention order statistics and model evaluation

Description

All datasets from RepoRT are listed, detailing in which evaluation scenario each dataset is used. Information on which datasets are missing important metadata (HSM and Tanaka parameters, pH, void volume estimate, column temperature, flow rate) are also provided. Datasets removed from evaluation following manual curation are specified.

Actions

Supplementary weblinks

Title

Description

Actions

Title

Code for model training, evaluation and application

Description

GitHub repository containing the code to train, evaluate and apply the two-step retention time prediction models.

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Dec 23, 2024 Version 1

Metrics

865

348

Views

Downloads

License

The content is available under CC BY NC 4.0

DOI

10.26434/chemrxiv-2024-wd5j8

Funding

Deutsche Forschungsgemeinschaft

BO 1910/23

Deutsche Forschungsgemeinschaft

MW 4382/10-1

Ministry for Economics, Sciences and Digital Society of Thuringia

Framework ProDigital, DigLeben 5575/10-9

Author’s competing interest statement

S.B. is a cofounder of Bright Giant GmbH.

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Times are changing but order matters: Transferable prediction of small molecule liquid chromatography retention times

Authors

Abstract

Keywords

Supplementary materials

Supplementary weblinks

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share