Screening Unknown Novel Psychoactive Substances Using GC-MS Based Machine Learning

02 November 2022, Version 2
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

In recent years, there is a large increase in structural diversity of novel psychoactive substances (NPS), exacerbating drug abuse issues as these variants evade classical detection methods such as spectral library matching. Gas chromatography mass spectrometry (GC-MS) is commonly used to identify these NPS. To tackle this issue, machine learning models are developed to address the analytical challenge of identifying unknown NPS, using only GC-MS data. 891 GC-MS spectra are used to train and evaluate multiple supervised machine learning classifiers, namely artificial neural network (ANN), convolutional neural network (CNN) and balanced random forest (BRF). 7 classes, comprising 6 NPS classes (cathinone, cannabinoids, phenethylamine, piperazine, tryptamines and fentanyl) and other unrelated compounds can be effectively classified with a macro-F1 score ~ 0.9, averaged across all cross-validation folds. These results indicate that machine learning models are a promising complement as an effective NPS detection tool.

Keywords

Chemometrics
Neural network
Machine learning
Random Forest
Mass Spectrometry
Artificial intelligence

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.