Building Block-Based Binding Predictions for DNA-Encoded Libraries

Chris Zhang; Mary Pitman; Anjali Dixit; Sumudu Leelananda; Henri Palacci; Meghan Lawler; Svetlana Belyanskaya; LaShadric Grady; Joe Franklin; Nicolas Tilmans; David Mobley

doi:10.26434/chemrxiv-2023-pq197

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Building Block-Based Binding Predictions for DNA-Encoded Libraries

17 April 2023, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

DNA-encoded libraries (DELs) provide the means to make and screen millions of diverse compounds against a target of interest in a single experiment. However, despite producing large volumes of binding data at a relatively low cost, the DEL selection process is susceptible to noise, necessitating computational follow-up to increase signal-to-noise ratios. In this work, we present a set of informatics tools to analyze DEL selection data so that subsequent DEL screens probe productive regions of chemical space. Our approach segments DEL data at the individual building block level to identify productive building blocks in a library. We show how similar building blocks have a similar probability of binding, which we then employ to predict the behavior of untested building blocks. Lastly, we build a model from the inference that the combined behavior of individual building blocks is predictive of the activity of an overall compound. We report a performance of more than an order of magnitude greater than random guessing on a holdout set, demonstrating that our model can serve as a baseline for comparison against other machine learning models on DEL data.

Keywords

DNA encoded libraries

combinatorial chemistry

dimensionality reduction

chemical similarity

Supplementary materials

Title

Description

Actions

Title

Supporting Information: Building Block-Based Binding Predictions for DNA-Encoded Libraries

Description

The Supporting Information includes additional methods on how we constructed the HDBSCAN loss function and how we generated the DEL selection data. We also include additional data on hyperparameter optimization, evaluation of the method using 2D Tanimoto similarity and data tables for the figures presented in the main text.

Actions

Title

Compiled dataset of sEH binders and non-binders

Description

Data used to perform analysis in the paper.

Actions

Supplementary weblinks

Title

Description

Actions

Title

DEL analysis

Description

GitHub repository containing all associated code and scripts.

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Apr 17, 2023 Version 1

Metrics

2,525

1,994

Views

Downloads

Citations

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2023-pq197

Funding

National Institutes of Health

R01GM108889

National Institutes of Health

R35GM148236

National Science Foundation

CNS-1828779

Author’s competing interest statement

DLM serves on the scientific advisory boards of OpenEye Scientific Software and Anagenex, and is an Open Science Fellow with Roivant Therapeutics. Aside from that, the authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Building Block-Based Binding Predictions for DNA-Encoded Libraries

Authors

Abstract

Keywords

Supplementary materials

Supplementary weblinks

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share