Heterogeneous Ensemble Enables a Universal Uncertainty Metric for Atomistic Foundation Models

Kai Liu; Zixiong Wei; Wei Gao; Poulumi Dey; Marcel H.F. Sluiter; Fei Shuang

arXiv:2507.21297·cond-mat.mtrl-sci·July 30, 2025

Heterogeneous Ensemble Enables a Universal Uncertainty Metric for Atomistic Foundation Models

Kai Liu, Zixiong Wei, Wei Gao, Poulumi Dey, Marcel H.F. Sluiter, Fei Shuang

PDF

TL;DR

This paper introduces a scalable uncertainty metric for atomistic foundation models using a heterogeneous ensemble, enabling safer deployment, improved accuracy, and cost-effective system-specific potential development.

Contribution

It presents a novel, unified uncertainty metric based on heterogeneous ensembles that correlates with prediction errors and enhances model reliability and efficiency.

Findings

01

Uncertainty metric correlates strongly with true prediction errors.

02

System-specific potentials achieve near-DFT accuracy with minimal labels.

03

Filtering label noise can improve model accuracy beyond original labels.

Abstract

Universal machine learning interatomic potentials (uMLIPs) are reshaping atomistic simulation as foundation models, delivering near \textit{ab initio} accuracy at a fraction of the cost. Yet the lack of reliable, general uncertainty quantification limits their safe, wide-scale use. Here we introduce a unified, scalable uncertainty metric \(U\) based on a heterogeneous model ensemble with reuse of pretrained uMLIPs. Across chemically and structurally diverse datasets, \(U\) shows a strong correlation with the true prediction errors and provides a robust ranking of configuration-level risk. Leveraging this metric, we propose an uncertainty-aware model distillation framework to produce system-specific potentials: for W, an accuracy comparable to full-DFT training is achieved using only \(4\%\) of the DFT labels; for MoNbTaW, no additional DFT calculations are required. Notably, by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.