Comparative study of ensemble-based uncertainty quantification methods for neural network interatomic potentials

Yonatan Kurniawan (1); Mingjian Wen (2); Ellad B. Tadmor (3); Mark K. Transtrum (1) ((1) Department of Physics; Astronomy; Brigham Young University; Provo; Utah; USA; (2) Institute of Fundamental; Frontier Sciences; University of Electronic Science; Technology of China; Chengdu; China; (3) Department of Aerospace Engineering; Mechanics; University of Minnesota; Minneapolis; Minnesota; USA)

arXiv:2508.06456·cond-mat.mtrl-sci·August 11, 2025

Comparative study of ensemble-based uncertainty quantification methods for neural network interatomic potentials

Yonatan Kurniawan (1), Mingjian Wen (2), Ellad B. Tadmor (3), Mark K. Transtrum (1) ((1) Department of Physics, Astronomy, Brigham Young University, Provo, Utah, USA, (2) Institute of Fundamental, Frontier Sciences, University of Electronic Science, Technology of China, Chengdu

PDF

Open Access

TL;DR

This study evaluates ensemble-based uncertainty quantification methods for neural network interatomic potentials, revealing limitations in their reliability for system-level predictions, especially in out-of-distribution scenarios.

Contribution

It systematically compares multiple ensemble-based methods for uncertainty quantification in neural network potentials and assesses their effectiveness in both in-distribution and out-of-distribution regimes.

Findings

01

Uncertainty estimates can plateau or decrease as errors increase in OOD settings.

02

Current methods have fundamental limitations in reliably indicating accuracy for extrapolative predictions.

03

Caution is advised when using predictive uncertainty as a proxy for accuracy in large-scale simulations.

Abstract

Machine learning interatomic potentials (MLIPs) enable atomistic simulations with near first-principles accuracy at substantially reduced computational cost, making them powerful tools for large-scale materials modeling. The accuracy of MLIPs is typically validated on a held-out dataset of \emph{ab initio} energies and atomic forces. However, accuracy on these small-scale properties does not guarantee reliability for emergent, system-level behavior -- precisely the regime where atomistic simulations are most needed, but for which direct validation is often computationally prohibitive. As a practical heuristic, predictive precision -- quantified as inverse uncertainty -- is commonly used as a proxy for accuracy, but its reliability remains poorly understood, particularly for system-level predictions. In this work, we systematically assess the relationship between predictive precision and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems