Evaluation of uncertainty estimations for Gaussian process regression based machine learning interatomic potentials
Matthias Holzenkamp, Dongyu Lyu, Ulrich Kleinekath\"ofer, Peter Zaspel

TL;DR
This paper evaluates the effectiveness of uncertainty estimations in Gaussian process regression-based machine learning interatomic potentials, highlighting their calibration issues and implications for active learning and model performance.
Contribution
It provides a comprehensive assessment of GPR uncertainty measures, revealing their calibration limitations and impact on active learning strategies in molecular potential predictions.
Findings
Ensemble uncertainties show poor global calibration.
GPR standard deviation has good calibration but systematic bias at high uncertainties.
High GPR uncertainty correlates with higher bias and error, but does not quantify potential error accurately.
Abstract
Uncertainty estimations for machine learning interatomic potentials (MLIPs) are crucial for quantifying model error and identifying informative training samples in active learning strategies. In this study, we evaluate uncertainty estimations of Gaussian process regression (GPR)-based MLIPs, including the predictive GPR standard deviation and ensemble-based uncertainties. We do this in terms of calibration and in terms of impact on model performance in an active learning scheme. We consider GPR models with Coulomb and Smooth Overlap of Atomic Positions (SOAP) representations as inputs to predict potential energy surfaces and excitation energies of molecules. Regarding calibration, we find that ensemble-based uncertainty estimations show already poor global calibration (e.g., averaged over the whole test set). In contrast, the GPR standard deviation shows good global calibration, but…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Fault Detection and Control Systems · Gaussian Processes and Bayesian Inference
MethodsGaussian Process
