Abstract Scope |
During the past few years, there has been an explosion of new ideas regarding features / descriptors, machine learning algorithms, and neural network architectures for predicting composition-property or structure-property relationships. However, there is no standard benchmark for measuring the performance of these algorithms. Other fields of applied ML have long since realized the value of a standardized ML benchmark; in image recognition, the ImageNet benchmark is one of the most influential papers and is widely credited with helping accelerate the stratospheric advancements in image processing neural networks. In this talk, I will describe the Matbench test set which is a set of 13 supervised machine learning problems derived from 10 experimental and ab initio datasets. These problems contain datasets which range in size from 312 to 132,752 samples, span multiple thermodynamic, electronic, optical, and mechanical properties, and are used to evaluate various state-of-the-art ML algorithms. |