A Data-Driven Perspective on the Colours of Metal-Organic Frameworks

01 October 2020, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Colour is at the core of chemistry and has been fascinating humans since ancient times. It is also a key descriptor of optoelectronic properties of materials and is used to assess the success of a synthesis. However, predicting the colour of a material based on its structure is challenging. In this work, we leverage subjective and categorical human assignments of colours to build a model that can predict the colour of compounds on a continuous scale, using chemically meaningful reasoning. In the process of developing the model, we also uncover inadequacies in current reporting mechanisms. For example, we show that the majority of colour assignments are subject to perceptive spread that would not comply with common printing standards. To remedy this, we suggest and implement an alternative way of reporting colour—and chemical data in general—that is more suitable for a data-driven approach to chemistry. All data is captured in an electronic lab notebook and subsequently exported to a repository.

Keywords

MOF
ELN
Machine Learning
Repository
Data in chemistry
Reproducibility

Supplementary materials

Title
Description
Actions
Title
20200923 colorpaper si
Description
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.