Structure-based out-of-distribution (OOD) materials property prediction: a benchmark study
Sadman Sadeed Omee, Nihang Fu, Rongzhi Dong, Ming Hu and, Jianjun Hu

TL;DR
This study benchmarks structure-based graph neural networks for out-of-distribution materials property prediction, revealing significant performance gaps and providing insights to enhance model robustness in realistic material discovery scenarios.
Contribution
It introduces a comprehensive OOD benchmark for GNNs in materials science and analyzes factors affecting their generalization performance.
Findings
State-of-the-art GNNs underperform on OOD tasks
Identified sources of robustness in certain GNN models
Provided insights for improving OOD generalization
Abstract
In real-world material research, machine learning (ML) models are usually expected to predict and discover novel exceptional materials that deviate from the known materials. It is thus a pressing question to provide an objective evaluation of ML model performances in property prediction of out-of-distribution (OOD) materials that are different from the training set distribution. Traditional performance evaluation of materials property prediction models through random splitting of the dataset frequently results in artificially high performance assessments due to the inherent redundancy of typical material datasets. Here we present a comprehensive benchmark study of structure-based graph neural networks (GNNs) for extrapolative OOD materials property prediction. We formulate five different categories of OOD ML problems for three benchmark datasets from the MatBench study. Our extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · X-ray Diffraction in Crystallography · Advanced Neural Network Applications
MethodsSparse Evolutionary Training
