Lies and Liabilities: Computational Assessment of High-Throughput Screening Hits to Identify Artifact Compounds.

22 February 2023, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Hits from high-throughput screening (HTS) of chemical libraries are often false positives due to their interference with the assay detection technology. In response to this problem, we generated the largest publicly available library of chemical liabilities and developed a free webtool to predict HTS artifacts, both of which are described herein. More specifically, we generated, curated, and integrated HTS datasets for thiol reactivity, redox activity, and luciferase (firefly and nano) activity. Using these curated datasets, we developed and validated several Quantitative Structure-Interference Relationship (QSIR) models to predict nuisance behavior. The resulting models showed balanced accuracy (BA) for external datasets ranging from 62 to 78%. These models were employed for virtual profiling of the National Center for Advancing Translational Science’s (NCATS) in-house library, and 256 compounds for each assay were selected for the experimental validation of nuisance behavior. The BA for these external predictions ranged from 58 to 78% for compounds within the applicability domains of the models. Our findings suggest that the QSIR models developed and validated herein identify nuisance compounds among experimental hits more reliably than popular PAINS filters. The models developed in this study, along with aggregation models previously developed by our group (SCAM Detective) were implemented in “Liability Predictor,” an online tool which may be used as part of chemical library design or for triaging HTS hits. Both the models and the curated datasets are publicly available at https://liability.mml.unc.edu/.

Keywords

Nuisance compounds
Assay Liability
PAINS
Aggregation
Luciferase
Assay artifacts
HTS artifacts
Hit Triage
QSPR
QSAR

Supplementary materials

Title
Description
Actions
Title
Supplementary Materials
Description
Supporting information includes curated SDF datasets and results of the experimental validation.
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.