Abstract
Determining how similar two materials are in terms of both atomic composition and crystallographic structure remains a challenge, the solution of which would enable generalised machine learning using crystal structure data. We demonstrate a new method of describing crystal structures based on interatomic distances, termed the Grouped Representation of Interatomic Distances (GRID). This fast to compute descriptor can equally be applied to crystalline or disordered materials, and encodes additional information beyond pairwise distances, such as coordination environments. Combined with earth mover’s distance as a measure of similarity, we show that GRID is able to quantitatively compare materials involving both short- and long-range structural variation. Using this new material descriptor, we show that it can accurately predict bulk moduli using a simple nearest-neighbour model, and that the resulting similarity shows good generalisability across multiple materials properties.
Supplementary materials
Title
Supporting information
Description
Supporting information contains: further plots of data distributions (bulk modulus distribution, calculated distances); visualisation of the compositional distance matrix, comparisons between EMD and cosine similarity; and details of anomalous materials.
Actions
Supplementary weblinks
Title
gridrdf code
Description
Git repository containing python code used to perform the analyses in the manuscript.
Actions
View