Fragment libraries from large synthetic compounds and natural products: A comparative chemoinformatic analysis

06 February 2025, Version 1

Abstract

We report comprehensive fragment libraries obtained from large natural product databases and compare their chemical space coverage and diversity with synthetic fragment libraries. Specifically, we obtained 2,583,127 fragments derived from the recently updated Collection of Open Natural Products (COCONUT) data set with more than 695,133 non-redundant natural products, and 74,193 fragments derived from the Latin America Natural Product Database (LANaPDB) with 13,578 unique natural products from Latin America. The content, chemical space coverage and chemical diversity of the natural product libraries were compared to the recently developed CRAFT library, which contains 1,214 fragments based on novel heterocyclic scaffolds and natural product-derived chemicals. The fragments libraries herein obtained and curated are freely available at https://github.com/DIFACQUIM/Fragment-libraries-from-large-synthetic-compounds-and-natural-products-collections.git.

Content

Supplementary materials

Supplementary figures
Eight supplementary figures.

Supplementary weblinks

Fragment-library
Fragment Library of the Center for Research and Advancement of Fragments and Molecular Targets
Fragment libraries: natural products and synthetic compounds
Fragment libraries from large synthetic compounds and natural products collections