Abstract
Computer-assisted design of small molecules has experienced a resurgence in academic and indus-
trial interest due to the widespread use of data-driven techniques such as deep generative models.
While the ability to generate molecules that fulfill required chemical properties is encouraging, the
use of deep learning models requires significant, if not prohibitive, amounts of data and computa-
tional power. At the same time, open-sourcing of more traditional techniques such as graph-based
genetic algorithms for molecular optimisation [Jensen, Chem. Sci., 2019, 12, 3567-3572] has shown
that simple and training-free algorithms can be efficient and robust alternatives. Further research
alleviated the common genetic algorithm issue of evolutionary stagnation by enforcing molecular
diversity during optimisation [Van den Abeele, Chem. Sci., 2020, 42, 11485-11491]. The crucial
lesson distilled from the simultaneous development of deep generative models and advanced genetic
algorithms has been the importance of chemical space exploration [Aspuru-Guzik, Chem. Sci., 2021,
12, 7079-7090]. For single-objective optimisation problems, chemical space exploration had to be
discovered as a usable resource but in multi-objective optimisation problems, an exploration of trade-
offs between conflicting objectives is inherently present. In this paper we provide state-of-the-art
and open-source implementations of two generations of graph-based non-dominated sorting genetic
algorithms (NSGA-II, NSGA-III) for molecular multi-objective optimisation. In addition, we provide
the results of a series of benchmarks for the inverse design of small molecule drugs for both the
NSGA-II and NSGA-III algorithms.
Supplementary materials
Title
Data
Description
Data of the Pareto optimisations as presented in the paper.
Actions
Supplementary weblinks
Title
Github repository
Description
This github repository contains open-source implementations of NSGA-II, NSGA-III and GB-EPI for optimization of small organic molecules.
Actions
View