Comparative study of ensemble-based uncertainty quantification methods for neural network interatomic potentials
Yonatan Kurniawan (1), Mingjian Wen (2), Ellad B. Tadmor (3), Mark K. Transtrum (1) ((1) Department of Physics, Astronomy, Brigham Young University, Provo, Utah, USA, (2) Institute of Fundamental, Frontier Sciences, University of Electronic Science, Technology of China, Chengdu

TL;DR
This study evaluates ensemble-based uncertainty quantification methods for neural network interatomic potentials, revealing limitations in their reliability for system-level predictions, especially in out-of-distribution scenarios.
Contribution
It systematically compares multiple ensemble-based methods for uncertainty quantification in neural network potentials and assesses their effectiveness in both in-distribution and out-of-distribution regimes.
Findings
Uncertainty estimates can plateau or decrease as errors increase in OOD settings.
Current methods have fundamental limitations in reliably indicating accuracy for extrapolative predictions.
Caution is advised when using predictive uncertainty as a proxy for accuracy in large-scale simulations.
Abstract
Machine learning interatomic potentials (MLIPs) enable atomistic simulations with near first-principles accuracy at substantially reduced computational cost, making them powerful tools for large-scale materials modeling. The accuracy of MLIPs is typically validated on a held-out dataset of \emph{ab initio} energies and atomic forces. However, accuracy on these small-scale properties does not guarantee reliability for emergent, system-level behavior -- precisely the regime where atomistic simulations are most needed, but for which direct validation is often computationally prohibitive. As a practical heuristic, predictive precision -- quantified as inverse uncertainty -- is commonly used as a proxy for accuracy, but its reliability remains poorly understood, particularly for system-level predictions. In this work, we systematically assess the relationship between predictive precision and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems
