ArchOnML; an open-source python package for machine-learning assisted molecular design

13 March 2025, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

The python package ArchOnML ("Archive-On-Machine-Learning") is introduced, which can perform virtual screening projects covering up to millions of structural derivatives through the use of Kernel Ridge Regression models. It supports the full workflow of setting up calculation inputs for external quantum chemistry program packages and post-processesing their outputs for training and predictions. An example screening project for over 1.3 million anthraquinone derivatives is presented, where excitation energies and oscillator strengths of the first two excited singlet and triplet states are predicted from descriptors based on semi-empirical quantum chemistry results. Compared to a non-ML calculation protocol, ArchOnML achieves a speed-up factor of over 400 with mean absolute errors for excitation energies of 0.1 eV. Due to ArchOnML's modular application programming interface, new descriptors, models and interfaces to other external quantum chemistry programs can be added in a straightforward way.

Keywords

machine learning
virtual screening
cheminformatics

Supplementary materials

Title
Description
Actions
Title
Supplementary Data
Description
This file contains additional data, not contained in the main article. The main article references the sections of this file.
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.