Evaluating the Diversity and Target Addressability of DNA-encoded Libraries using BM-Scaffold Analysis and Machine Learning

22 October 2024, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

DNA-encoded libraries (DELs) enable efficient experimental screening of vast combinatorial molecular libraries, making it a powerful platform for drug discovery. Apart from ensuring the druglike physicochemical properties, other key parameters to maximize the success rate of DEL designs include the scaffold diversity and target addressability. While several computational tools have been developed to evaluate DEL chemical diversity, a dedicated tool that combines both parameters is currently lacking. In this work, we developed a computational approach to systematically evaluate both the scaffold diversity and target-orientedness of DELs using Bemis-Murcko (BM) scaffold analysis and machine learning. To demonstrate the utility of this approach, we present a case study using two of our in-house produced libraries. We show that our workflow can effectively distinguish between a generalist and a focused library. Furthermore, we show that although focused libraries tend to have higher compound-based addressability, they could suffer from lower scaffold-based addressability relative to a generalist library. Consequently, we illustrate how our computational tool can guide medicinal chemists in deciding which library to screen as a function of the objective, whether it is hit-finding or hit-optimization. To facilitate utilization, this tool is freely available both as a web application and as a Python script at https://github.com/novalixofficial/NovaWebApp.

Keywords

DELs
DNA-encoded Library
target addressability
chemical space
chemical diversity

Supplementary materials

Title
Description
Actions
Title
Supporting Information
Description
Statistical analysis of cross validation scores
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.