Abstract
We present an efficient algorithm for substructure search in combinatorial libraries defined by synthons, i.e. substructures with connection points. Our method improves on existing approaches by introducing powerful heuristics and fast fingerprint screening to quickly eliminate branches of non matching combinations of synthons. With this we achieve typical response times of a few seconds on a standard desktop computer for searches in large combinatorial libraries like the Enamine REAL space. We published the Java source as part of the OpenChemLib under the BSD license, and we implemented tools to enable substructure search in custom combinatorial libraries.
Supplementary materials
Title
FFP Fragment IDCodes
Description
OpenChemLib IDCodes of the substructure fragments of the fragment fingerprint.
Actions
Title
Benchmark Results for Search in Enamine REAL Space
Description
Benchmark results of 3000 small molecule queries in the Enamine REAL Space
Actions
Supplementary weblinks
Title
OpenChemLib-Hyperspace GitHub Repository
Description
OpenChemLib-Hyperspace GitHub repository that contains code for performing fast substructure search in large combinatorial library spaces.
Actions
View