Graph Neural Networks Bootstrapped for Synthetic Selection and Validation of Small Molecule Immunomodulators

Prageeth R. Wijewardhane; Krupal P. Jethava; Jonathan A Fine; Gaurav Chopra

doi:10.26434/chemrxiv-2021-r4xnx-v2

Biological and Medicinal Chemistry

Search within Biological and Medicinal Chemistry

Graph Neural Networks Bootstrapped for Synthetic Selection and Validation of Small Molecule Immunomodulators

18 October 2021, Version 2

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

The Programmed Cell Death Protein 1/Programmed Death-Ligand 1 (PD-1/PD-L1) interaction is an immune checkpoint utilized by cancer cells to enhance immune suppression. There is a huge need to develop small molecule drugs that are fast acting, cost effective, and readily bioavailable compared to antibodies. Unfortunately, synthesizing and validating large libraries of small- molecules to inhibit PD-1/PD-L1 interaction in a blind manner is both time-consuming and expensive. To improve this drug discovery pipeline, we have developed a machine learning methodology trained on patent data to identify, synthesize, and validate PD-1/PD-L1 small molecule inhibitors. Our model incorporates two features: docking scores to represent the energy of binding (E) as a global feature and sub-graph features through a graph neural network (GNN) of molecular topology to represent local features. This interaction energy-based Graph Neural Network (EGNN) model outperforms traditional machine learning methods and a simple GNN with a F1 score of 0.9524 and Cohen’s kappa score of 0.8861 for the hold out test set, suggesting that the topology of the small molecule, the structural interaction in the binding pocket, and chemical diversity of the training data are all important considerations for enhancing model performance. A Bootstrapped EGNN model was used to select compounds for synthesis and experimental validation with predicted high and low potency to inhibit PD-1/PD-L1 interaction. The potent inhibitor, (4-((3-(2,3-dihydrobenzo[b][1,4]dioxin-6-yl)-2- methylbenzyl)oxy)-2,6-dimethoxybenzyl)-D-serine, is a hybrid of two known bioactive scaffolds, with an IC50 of 339.9 nM that is comparatively better than the known bioactive compound. We conclude that our bootstrapped EGNN model will be useful to identify target-specific high potency molecules designed by scaffold hopping, a well-known medicinal chemistry technique.

Keywords

Cancer immunotherapy

Small molecule immunomodulators

PD-1/PD-L1 inhibitors

protein-protein interaction inhibitors

drug design

graph neural networks

Molecular Docking Approaches

HTRF assay

chemical synthesis

Scaffold Hopping

bootstrapping analysis

Supplementary materials

Title

Description

Actions

Title

Supporting Information EGNN paper

Description

Supplementary material - Details of graph neural network model performance, results of HTRF assay, Characterization of synthetic compounds

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Oct 18, 2021 Version 2

Apr 06, 2020 Version 1

Version Notes

New machine learning models, bootstrapping and additional experiments

Metrics

5,594

1,866

Views

Downloads

Citations

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2021-r4xnx-v2

Funding

NIH NCATS ASPIRE Awards

NIH NCATS ASPIRE Awards

Department of Chemistry start-up funds

Department of Chemistry start-up funds

Integrative Data Science Initiative award

Integrative Data Science Initiative award

NCATS Clinical and Translational Sciences Award from the Indiana Clinical and Translational Sciences Institute (UL1TR002529)

NCATS Clinical and Translational Sciences Award from the Indiana Clinical and Translational Sciences Institute (UL1TR002529)

Purdue University Center for Cancer Research, NIH grant P30 CA023168

Purdue University Center for Cancer Research, NIH grant P30 CA023168

Author’s competing interest statement

The authors declare no conflict of interest. New chemical scaffolds filed for patent.

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Graph Neural Networks Bootstrapped for Synthetic Selection and Validation of Small Molecule Immunomodulators

Authors

Abstract

Keywords

Supplementary materials

Comments

Version History

Version Notes

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share