Abstract
Quinones and hydroquinones are small organic molecules with numerous applications: battery electrolytes, pharmaceuticals, sensors, to name a few. An understanding of their fundamental properties, such as melting points, is essential to incorporate these compounds into relevant technologies. In this study, two different approaches were investigated to predict the melting points of quinone and hydroquinone-based molecules. In the first approach, molecular features were calculated with the Mordred molecular descriptor calculator and used to train a ridge regression and a random forest machine learning (ML) model. In the second, a simpler featurization that captures key enthalpic and entropic descriptors was applied in three model variants - a thermodynamic model, a ridge regression model, and a random forest model. The Mordred-calculated features in the ridge regression model outperformed the thermodynamic features across all models for the quinone dataset (average absolute errors (AAE): 29.5 °C), but the thermodynamic features in the thermodynamic model resulted in lower prediction errors for the hydroquinone dataset (AAE: 35.4 °C). These results emphasize the importance of including intermolecular interaction descriptors, especially for classes of molecules in which these interactions are expected to be strong (e.g. hydrogen bonding in hydroquinones). As a byproduct of this study, we also consolidate (from previously published data) four new datasets with quinone and hydroquinone melting points and other key features.
Supplementary materials
Title
Supplemental Information
Description
• Experimental Details of data curation and model implementation (pg 1-4)
• Thermodynamic Features in Thermodynamic Model for Hydrocarbon Dataset (pg 5 - 6)
• Thermodynamic Features in Thermodynamic Model for combined Quinone + Hydro- quinone Dataset (pg 6 - 7)
26
• Molecular Flexibility Components treated as individual features for Quinone and Hy- droquinone Datasets (pg 8 - 11)
Actions