Interpretable deep-learning pKa prediction for small molecule drugs via atomic sensitivity analysis

Joseph DeCorte; Benjamin Brown; Jens Meiler

doi:10.26434/chemrxiv-2024-hr692

Biological and Medicinal Chemistry

Search within Biological and Medicinal Chemistry

Interpretable deep-learning pKa prediction for small molecule drugs via atomic sensitivity analysis

12 June 2024, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Machine learning (ML) models play a crucial role in predicting properties essential to drug development, such as a drug’s logscale acid-dissociation constant (pKa). Despite recent architectural advances, these models often generalize poorly to novel compounds due to a scarcity of ground-truth data. Further, these models lack interpretability, in part due to a dependence on explicit encodings of input molecules’ molecular substructures. To this end, atomic-resolution information is accessible in chemical structures by observing model response to atomic perturbations of an input molecule; however, no methods exist that systematically utilize this information for model and molecular analysis. Here, we present BCL-XpKa, a substructure-independent, deep neural network (DNN)-based pKa predictor that generalizes well to novel small molecules. BCL-XpKa discretizes pKa prediction from a regression problem into a multitask-classification problem, which accumulates data for prediction at biologically relevant pH values and records the model’s uncertainty in its prediction as a discrete distribution for each pKa prediction. BCL-XpKa outperforms modern ML pKa predictors and accurately models the effects of common molecular modifications on a molecule’s ionizability. We then leverage BCL-XpKa’s substructure independence to introduce atomic sensitivity analysis (ASA), which quickly decomposes a molecule’s predicted pKa value into its respective atomic contributions without model retraining. When paired with BCL-XpKa, ASA informs that BCL-XpKa has implicitly learned high-resolution information about molecular substructures. We further demonstrate ASA’s utility in structure preparation for protein-ligand docking by identifying ionization sites in 97.8% and 83.4% of complex small molecule acids and bases. We then apply ASA with BCL-XpKa to understand the physicochemical liabilities and guide optimization of a recently published KRAS-degrading PROTAC.

Keywords

pKa prediction

model explainability

model interpretability

QSPR

QSAR

neural networks

Supplementary materials

Title

Description

Actions

Title

Supplementary Tables

Description

Hyperparameter Optimization for BCL-XpKa

Actions

Supplementary weblinks

Title

Description

Actions

Title

The Biology and Chemistry Library

Description

Open-source cheminformatics platform developed by the Meiler lab and used throughout this work.

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Jun 12, 2024 Version 1

Metrics

779

396

Views

Downloads

Citations

License

The content is available under CC BY NC 4.0

DOI

10.26434/chemrxiv-2024-hr692

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) declare that they have sought and gained approval from the relevant ethics committee/IRB for this research and its publication.

Interpretable deep-learning pKa prediction for small molecule drugs via atomic sensitivity analysis

Authors

Abstract

Keywords

Supplementary materials

Supplementary weblinks

Comments

Version History

Metrics

License

DOI

Author’s competing interest statement

Ethics

Share