Abstract
Recent advances in machine learning have led to the rapid adoption of various computational methods for de novo molecular design in polymer research, including high-throughput virtual screening and inverse molecular design. In such workflows, molecular generators play an essential role in the creation or sequential modification of candidate polymer structures. Machine learning-assisted molecular design has made great technical progress over the past few years. However, its practical deployment has not progressed as much as expected. One reason for this is the difficulty in determining the synthetic routes to such designed polymers. To address this technical limitation, we present SMiPoly, a Python library for virtual polymer generation that implements 22 chemical rules for commonly applied polymerization reactions. For given small organic molecules to form a candidate monomer set, the SMiPoly generator conducts possible polymerization reactions to generate an exhaustive list of potentially synthesizable polymers. In this study, using 1,083 readily available monomers, we generated 169,347 unique polymers forming seven different molecular types: polyolefin, polyester, polyether, polyamide, polyimide, polyurethane, and polyoxazolidone. By comparing the distribution of the virtually created polymers with approximately 16,000 real polymers synthesized so far, it was found that the coverage and novelty of the SMiPoly-generated polymers can reach 48% and 53%, respectively.
Supplementary materials
Title
supporting information for the monomer definition and the polymerization reaction rules
Description
Supplementary note for the definition of monomers and polymerization reactions rules. The examples of generated polymers are also described.
Actions
Supplementary weblinks
Title
SMiPoly
Description
The source code of SMiPoly is available from the GitHub website.
Actions
View