Abstract
Feature selection is an important topic that has been widely studied in data science. Recently, graph neural networks (GNNs) and graph convolutional networks (GCNs) have also been employed in chemistry. To enhance the performance characteristics of the GNN and GCN in the field of chemistry, the feature selection should also be discussed in detail from the chemistry viewpoint. Thus, this study proposes a new feature in molecular GNNs based on the quantum chemical approaches and discusses the accuracy, overcorrelation between features, and interpretability. From the overcorrelation and accuracy, the important graph convolution (IGC) with molecular-atomic properties (MAP) proposed herein showed good performance. Moreover, the integrated gradients analysis showed that the machine learning model with the IGC(MAP) explained the prediction outputs reasonably.
Supplementary materials
Title
Supporting information
Description
Details of the machine learning process and RMSEs of the pKa values obtained by IGC and two types of atom features (IAP and MAP) with a different number of convolution layers.
Actions