Graph Transformer Foundation Model for modeling ADMET properties

20 September 2024, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Pre-trained on vast datasets, foundation models demonstrate strong generalization across different tasks, particularly in natural language processing and computer vision. This study explores the potential of applying foundation models to the domain of ADMET (Absorption, Distribution, Metabolism, Excretion, and Toxicity) property prediction, a critical task in early-stage drug discovery. We present a Graph Transformer Foundation Model (GTFM) that combines the strengths of graph neural networks (GNNs) and transformer architectures to model molecular graphs. We use self-supervised learning (SSL) to extract useful representations from large unlabeled datasets by predicting masked nodes/edges and the Joint Embedding Predictive Architecture (JEPA). The latter demonstrates superior performance in most tasks by learning robust and predictive features from molecular graphs. The GTFM is benchmarked against classical machine learning models using predefined molecular descriptors. The results demonstrate that the GTFM, especially when employing JEPA, outperforms classical approaches for ADMET property prediction in 8 out of 19 classification and 5 out of 9 regression tasks, being comparable in the rest. This shows foundation models, specifically GTFM, as a promising approach for ADMET modeling, providing a scalable and versatile solution for drug discovery applications.

Keywords

Drug discovery
Foundation models
Transformers
ADMET
Graph Neural Network

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.