Applicability Domain of Polyparameter Linear Free Energy Relationship Models Evaluated by Leverage and Prediction Interval Calculation

Satoshi Endo

doi:10.26434/chemrxiv-2022-qs03q-v2

Earth, Space, and Environmental Chemistry

Search within Earth, Space, and Environmental Chemistry

Applicability Domain of Polyparameter Linear Free Energy Relationship Models Evaluated by Leverage and Prediction Interval Calculation

03 February 2022, Version 2

Working Paper

Satoshi Endo

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Polyparameter linear free energy relationships (PP-LFERs) are accurate and robust models employed to predict equilibrium partition coefficients (K) of organic chemicals. The accuracy of predictions by a PP-LFER depends on the composition of the respective calibration data set. Generally, extrapolation outside the model calibration domain is likely to be less accurate than interpolation. In this study, the applicability domain (AD) of PP-LFERs was systematically evaluated by calculating the leverage (h) and prediction interval (PI). Repeated simulations with experimental data showed that the root mean squared error of predictions increased with h. However, the analysis also showed that PP-LFERs calibrated with a large number (e.g., 100) of training data were highly robust against extrapolation error. For such well-calibrated PP-LFERs, the common definition of extrapolation (h > 3 hmean, where hmean is the mean h of all training compounds) may be excessively strict. Alternatively, the PI is proposed as a metric to define the AD of PP-LFERs, as it provides a concrete estimate of the error range that agrees well with the observed errors, even for extreme extrapolations. Additionally, published PP-LFERs were evaluated in terms of their AD using the new concept of AD probes, which indicated the varying predictive performance of PP-LFERs in existing literature for environmentally relevant compounds.

Keywords

partiton coefficient

linear solvation energy relationship

Supplementary materials

Title

Description

Actions

Title

Electronic supplementary information 1

Description

Additional tables and figures

Actions

Title

Electronic supplementary information 2

Description

Excel file with a macro that calculates the leverage and the prediction intervals

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Applicability Domain of Polyparameter Linear Free Energy Relationship Models Evaluated by Leverage and Prediction Interval Calculation

Satoshi Endo journal article

Environmental Science & Technology , Volume 56, Issue 9

Online publication date: Apr 14, 2022

Version History

Feb 03, 2022 Version 2

Jan 06, 2022 Version 1

Version Notes

Feb 03, 2022 Version 2 (after extensive revision)

Metrics

952

332

Views

Downloads

Citations

License

The content is available under CC BY NC 4.0

DOI

10.26434/chemrxiv-2022-qs03q-v2

Funding

JSPS

KAKENHI Grant Number JP18K05204

JSPS

KAKENHI Grant Number JP16K16216

MEXT/JST

Tenure Track Promotion Program

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Applicability Domain of Polyparameter Linear Free Energy Relationship Models Evaluated by Leverage and Prediction Interval Calculation

Authors

Abstract

Keywords

Supplementary materials

Comments

Now Published

Version History

Version Notes

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share