Retrieval Augmented Docking using Hierarchical Navigable Small Worlds

Brendan Hall; Michael Keiser

doi:10.26434/chemrxiv-2024-qsdd1

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Retrieval Augmented Docking using Hierarchical Navigable Small Worlds

24 April 2024, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Make-on-demand chemical libraries have drastically increased the reach of molecular docking, with the enumerated ready-to-dock ZINC library approaching 5 billion molecules. While ever-growing libraries result in better-scoring molecules, the computational resources required to dock all of ZINC make this endeavor infeasible for most. Here, we organize and traverse chemical space with hierarchical navigable small world graphs, a method we term retrieval augmented docking (RAD). RAD recovers most virtual actives despite docking only a fraction of the library. Furthermore, RAD is protein-agnostic, supporting screens against many targets without additional computational overhead. In depth, we assess RAD on published large-scale docking campaigns against D4 and AmpC spanning 99.5 million and 138 million molecules, respectively. RAD recovers 95% of DOCK virtual actives for both targets after evaluating only 10% of the libraries. In breadth, RAD shows widespread applicability against 43 DUDE-Z proteins, evaluating 50.3 million associations. On average, RAD recovers 87% of virtual actives while docking 10% of the library without sacrificing chemical diversity.

Keywords

RAD

retrieval augmented docking

HNSW

hierarchical navigable small world

Supplementary materials

Title

Description

Actions

Title

Supplementary Information

Description

Supplementary Figures 1-10, Supplementary Tables 1-2

Actions

Title

Supplementary Data 1

Description

Supplementary Tables 3-5

Actions

Supplementary weblinks

Title

Description

Actions

Title

RAD GitHub Repository

Description

Open-source code repository for the Retrieval Augmented Docking (RAD) package developed in this study. It contains the complete source code and the modified hnswlib library and code for constructing and traversing HNSWs with user-implemented scoring functions.

Actions

View

Title

RAD Paper Zenodo Data Repository

Description

DOCK scores for the DUDE-Z "goldilocks" molecules docked to each of the 43 DUDE-Z proteins in the paper.

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Retrieval Augmented Docking Using Hierarchical Navigable Small Worlds

Brendan W. Hall, Michael J. Keiser journal article

Journal of Chemical Information and Modeling , Volume 64, Issue 19

Online publication date: Oct 03, 2024

Version History

Apr 24, 2024 Version 1

Metrics

946

461

Views

Downloads

Citations

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2024-qsdd1

Funding

Chan Zuckerberg Initiative

DAF2018-191905 (DOI 10.37921/550142lkcjzw)

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Retrieval Augmented Docking using Hierarchical Navigable Small Worlds

Authors

Abstract

Keywords

Supplementary materials

Supplementary weblinks

Comments

Now Published

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share