A Simple and Scalable Kernel Density Approach for Reliable Uncertainty Quantification in Atomistic Machine Learning
Daniel Willimetz, Luk\'a\v{s} Grajciar

TL;DR
This paper introduces a GPU-accelerated, scalable kernel density estimation method in PCA-reduced space for reliable uncertainty quantification in atomistic machine learning, enhancing interpretability and robustness.
Contribution
It presents a transferable, model-agnostic uncertainty metric based on KDE that efficiently detects sparsely sampled regions without retraining models.
Findings
KDE-based scores reliably flag extrapolative configurations.
The method correlates well with ensemble-based uncertainties.
It highlights regions of reduced prediction trustworthiness.
Abstract
Machine learning models are increasingly used to predict material properties and accelerate atomistic simulations, but the reliability of their predictions depends on the representativeness of the training data. We present a scalable, GPU-accelerated uncertainty quantification framework based on -nearest-neighbor kernel density estimation (KDE) in a PCA-reduced descriptor space. This method efficiently detects sparsely sampled regions in large, high-dimensional datasets and provides a transferable, model-agnostic uncertainty metric without requiring retraining costly model ensembles. The framework is validated across diverse case studies varying in: i) chemistry, ii) prediction models (including foundational neural network), iii) descriptors used for KDE estimation, and iv) properties whose uncertainty is sought. In all cases, the KDE-based score reliably flags extrapolative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Electron and X-Ray Spectroscopy Techniques · Ion-surface interactions and analysis
