Revealing Structure-Property Relationships in Polybenzenoid Hydrocarbons with Interpretable Machine-Learning

16 May 2022, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

The structure-property relationships of polybenzenoid hydrocarbons (PBHs) were investigated with interpretable machine learning, for which two new tools were developed and applied. First, a novel textual molecular representation, based on the annulation sequence of PBHs was defined and developed. This representation can be used either in its textual form or as a basis for a curated feature-vector; both forms show improved interpretability over the standard SMILES representation, and the former also has increased predictive accuracy. Second, the recently-developed model, CUSTODI, was applied for the first time as an interpretable model and identified important structural features that impact various electronic molecular properties. The resulting insights not only validate several well-known “rules of thumb” of organic chemistry but also reveal new behaviors and influential structural motifs, thus providing guiding principles for rational design and fine-tuning of PBHs.

Keywords

polycyclic aromatic hydrocarbons
polybenzenoid hydrocarbons
interpretable machine learning
CUSTODI
COMPAS Project
structure-property relationships
molecular design

Supplementary materials

Title
Description
Actions
Title
Supporting Information for LALAS paper
Description
Details of computational methods and model construction. Full fit results. Additional comparison to other models.
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.