Heterogeneous Ensemble Enables a Universal Uncertainty Metric for Atomistic Foundation Models
Kai Liu, Zixiong Wei, Wei Gao, Poulumi Dey, Marcel H.F. Sluiter, Fei Shuang

TL;DR
This paper introduces a scalable uncertainty metric for atomistic foundation models using a heterogeneous ensemble, enabling safer deployment, improved accuracy, and cost-effective system-specific potential development.
Contribution
It presents a novel, unified uncertainty metric based on heterogeneous ensembles that correlates with prediction errors and enhances model reliability and efficiency.
Findings
Uncertainty metric correlates strongly with true prediction errors.
System-specific potentials achieve near-DFT accuracy with minimal labels.
Filtering label noise can improve model accuracy beyond original labels.
Abstract
Universal machine learning interatomic potentials (uMLIPs) are reshaping atomistic simulation as foundation models, delivering near \textit{ab initio} accuracy at a fraction of the cost. Yet the lack of reliable, general uncertainty quantification limits their safe, wide-scale use. Here we introduce a unified, scalable uncertainty metric \(U\) based on a heterogeneous model ensemble with reuse of pretrained uMLIPs. Across chemically and structurally diverse datasets, \(U\) shows a strong correlation with the true prediction errors and provides a robust ranking of configuration-level risk. Leveraging this metric, we propose an uncertainty-aware model distillation framework to produce system-specific potentials: for W, an accuracy comparable to full-DFT training is achieved using only \(4\%\) of the DFT labels; for MoNbTaW, no additional DFT calculations are required. Notably, by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
