Abstract
Computational exploration of chemical space is crucial in modern cheminformatics research for accelerating the discovery of new biologically active compounds. In this study, we present a detailed analysis of the chemical library of potential glucocorticoid receptor (GR) ligands generated by the molecular generator, Molpher. To generate the targeted GR library and to construct the classification models, structures from the ChEMBL database as well as from the internal IMG library, that was experimentally screened for biological activity in the primary luciferase reporter cell assay, were utilized. The composition of the targeted GR ligand library was compared with a reference library that randomly samples chemical space. A random forest model was used to determine the biological activity of ligands, incorporating its applicability domain using conformal prediction. It was demonstrated that the GR library is significantly enriched with GR ligands compared to the random library. Furthermore, prospective analysis demonstrated that Molpher successfully designed compounds which were subsequently experimentally confirmed to be active on the GR. A collection of 34 potential new GR ligands was also identified. Moreover, an important contribution of this study is the establishment of a comprehensive workflow for the evaluation of computationally generated ligands, particularly those with potential activity against targets that are challenging to dock.
Supplementary materials
Title
Designed GR ligands
Description
The list and structures of 54 GR ligands with their QED, NIBR severity score, MolSkill score, predicted activity (pEC50 value), the result of the manual annotation and remarks, if available.
Actions
Title
HTS experiment description
Description
The description of the experimental testing of the IMG library using the primary luciferase reporter cell assay.
Actions