Uncertain of uncertainties? A comparison of uncertainty quantification metrics for chemical data sets

Maria H. Rasmussen; Chenru Duan; Heather J. Kulik; Jan Halborg Jensen

doi:10.26434/chemrxiv-2023-w93dm

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Uncertain of uncertainties? A comparison of uncertainty quantification metrics for chemical data sets

08 September 2023, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

With the increasingly more important role of machine learning (ML) models in chemical research, the need for putting a level of confidence to the model predictions naturally arises. Several methods for obtaining uncertainty estimates have been proposed in recent years but consensus on the evaluation of these have yet to be established and different studies on uncertainties generally uses different metrics to evaluate them. We compare three of the most popular validation metrics (Spearman’s rank correlation coefficient, the negative log likelihood (NLL) and the miscalibration area) to the error-based calibration introduced by Levi et al. (Sensors 2022, 22, 5540). Importantly, metrics such as the negative log likelihood (NLL) and Spearman’s rank correlation coefficient bear little information in themselves. We therefore introduce reference values obtained through errors simulated directly from the uncertainty distribution. The different metrics target different properties and we show how to interpret them, but we generally find the best overall validation to be done based on the error-based calibration plot introduced by Levi et al. Finally, we illustrate the sensitivity of ranking-based methods (e.g. Spearman’s rank correlation coefficient) towards test set design by using the same toy model on two different test sets and obtaining vastly different metrics (0.05 vs. 0.65).

Keywords

uncertainty quantification

Supplementary weblinks

Title

Description

Actions

Title

Data

Description

Data

Actions

View

Title

GitHub repo

Description

Code

Actions

View

Title

Colab nodebook

Description

code

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Sep 08, 2023 Version 1

Metrics

2,670

1,354

Views

Downloads

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2023-w93dm

Funding

Novo Nordisk Fonden

NNF20OC0064104

U.S. Department of Energy

Scientific Discovery through Advanced Computing (SciDAC)

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Uncertain of uncertainties? A comparison of uncertainty quantification metrics for chemical data sets

Authors

Abstract

Keywords

Supplementary weblinks

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share