UNIQUE: A Framework for Uncertainty Quantification Benchmarking

30 August 2024, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Machine learning (ML) models have become key in decision-making for many disciplines, including drug discovery and medicinal chemistry. ML models are generally evaluated prior to their usage for high-stake decisions, such as compound synthesis or experimental testing. However, no ML model is robust and predictive in all real-world scenarios. Therefore, uncertainty quantification (UQ) in ML predictions has gained importance in recent years. Many investigations have focused on developing methodologies that provide accurate uncertainty estimates for ML-based predictions. Unfortunately, there is no UQ strategy that consistently provides robust estimates about model’s applicability on new samples. Depending on the dataset, prediction task, and algorithm, accurate uncertainty estimations might be unfeasible to obtain. Moreover, the optimum UQ metric also varies across applications, and previous investigations have shown a lack of consistency across benchmarks. Herein, the UNIQUE (UNcertaInty QUantification bEnchmarking) framework is introduced to facilitate the comparison of UQ strategies in ML-based predictions. This Python library unifies the benchmarking of multiple UQ metrics, including the calculation of non-standard UQ metrics (combining information from the dataset and model), and providing a comprehensive evaluation. In such framework, UQ metrics are evaluated for different application scenarios, e.g. eliminate the predictions with the lowest confidence or obtain a reliable uncertainty estimate for an acquisition function. Taken together, this library will help to standardize UQ investigations and evaluate new methodologies.

Keywords

Uncertainty quantification
uncertainty estimation
applicability domain
machine learning
benchmarking
model evaluations
decision-making

Supplementary materials

Title
Description
Actions
Title
Supporting Information
Description
Supplementary Tables and Figures
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.