TADF-GEN: An Iterative and Self-improving Method with Long-term Memory and Dynamic Similarity Weight for Generating Thermally Activated Delayed Fluorescence (TADF) Molecules

10 October 2024, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Thermally Activated Delayed Fluorescence (TADF) molecules are expected to be used in emitting layer materials for the next generation of organic light-emitting diodes (OLEDs) in various display applications, but their high-throughput discovery/generation are still challenging due to their vast chemical space, high cost of quantum chemical calculations and the tricky exploration-exploitation trade-off. Besides, TADF populations are far away from existing open-source database, which hinders the use of these database and makes the direct use of deep generative models a challenge. To address these issues, in this work we present an iterative and self-improving workflow, TADF-GEN. We combine atom-wise and fragment-based morphing operations designed with domain knowledge, machine learning based property prediction methods, multiscale quantum chemical calculations, TADF-specific metrics, with our special implementation of long-term memory (LTM) and dynamic similarity weight (DSW). With TADF-GEN, we explored the chemical space including various types and sizes of TADF molecules for the first time and established a new dataset of over 1.3 M molecules among which over 39 K molecules with TD-DFT labelled data. Combined LTM with DSM, we find the improvement of molecular diversity inherently help generate molecules with better performance. Besides, our model can effectively navigate the chemical space across different TADF domains (from multiple resonance type to donor-acceptor type). With accurate double hybrid TD-DFT excited state calculations, our generated molecules are proved excellent in both TADF properties and diversity compared with their seed molecules. Our TADF-GEN model and generated database can be directly used for future TADF works. Our proposed workflow is expected to be also useful for molecular discovery/generation in other data-shortage domains such as batteries and semiconductor materials.

Supplementary materials

Title
Description
Actions
Title
TADF-GEN-SI
Description
More details in TADF-GEN workflow including discussions on benchmark experiments, effects of seed molecules, and supplementary figures.
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.