Abstract
Human gut microbial metabolites are currently undergoing much research due to their involvement in multiple biological processes important for health, including immunity, metabolism, nutrition, and the nervous system. Metabolites exert their effect through the interaction with host and bacterial proteins, suggesting the use of “metabolite-mimetic” molecules as drugs and nutraceutics. In the present work, we retrieve and analyze the full set of published interactions of these compounds with human and microbiome-relevant proteins, and find patterns in their structure, chemical class, target class, and biological origins. In addition, we use virtual screening to expand (> 4-fold) the interactions, validate them with retrospective analyses, and use bioinformatic tools to prioritize them based on biological relevance. In this way, we fill many of the chemobiological gaps observed in the published data. By providing these interactions we expect to speed up the full clarification of the chemobiological space of these compounds, by suggesting many reliable predictions for fast, focused experimental testing.
Supplementary materials
Title
Supporting Information
Description
Table S1: set of microbial genera typical in human microbial metagenomics analyses and used in this work.
Table S2: distribution by target classes of target sharing between gut microbial metabolites and drugs .
Table S3: set of published and predicted metabolite-target interactions. For each interaction, the following data is provided: hmdb identifier (“hmdb_id”), inchi string, chemical class (“chem_cl”), compound set (“cset”: Metabolites vs Drugs); specific compound set (“comp_set”: Drugs vs GutFL vs GutnoFL vs Gut/Serum); uniprot accession number of the target (“uniport_id”); target name (“tar_name”); target class (“tar_cl”); target biological group (“tar_biolgr”: “b” for bacterial, “h” for human); biological species (“organism”); source of data (“src”); pchembl-like affinity data (“pbind”); maximum Tanimoto coefficient for SEA prediction (“maxTc”); name of compound (“comp_name”); aggregated source of data (“src2”); even more aggregated source of data (“src3”); high priority target (“hpr”: empty vs “hum” for high-priority human vs “bac” for high-priority bacterial).
Actions