Matbench Discovery -- A framework to evaluate machine learning crystal stability predictions
Janosh Riebesell, Rhys E. A. Goodall, Philipp Benner, Yuan Chiang,, Bowen Deng, Gerbrand Ceder, Mark Asta, Alpha A. Lee, Anubhav Jain, Kristin A., Persson

TL;DR
Matbench Discovery is a benchmarking framework that evaluates machine learning models for predicting crystal stability, highlighting the effectiveness of universal interatomic potentials and emphasizing task-specific metrics for materials discovery.
Contribution
We introduce Matbench Discovery, a standardized evaluation framework and Python package for benchmarking ML models in crystal stability prediction, including a comprehensive comparison of various approaches.
Findings
UIP models achieved F1 scores of 0.57-0.82.
UIP models provided up to 6x acceleration in discovery.
Task-specific metrics revealed limitations of regression-based approaches.
Abstract
The rapid adoption of machine learning (ML) in domain sciences necessitates best practices and standardized benchmarking for performance evaluation. We present Matbench Discovery, an evaluation framework for ML energy models, applied as pre-filters for high-throughput searches of stable inorganic crystals. This framework addresses the disconnect between thermodynamic stability and formation energy, as well as retrospective vs. prospective benchmarking in materials discovery. We release a Python package to support model submissions and maintain an online leaderboard, offering insights into performance trade-offs. To identify the best-performing ML methodologies for materials discovery, we benchmarked various approaches, including random forests, graph neural networks (GNNs), one-shot predictors, iterative Bayesian optimizers, and universal interatomic potentials (UIP). Our initial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · X-ray Diffraction in Crystallography · Computational Drug Discovery Methods
MethodsFocus
