When Active Learning Fails, Uncalibrated Out of Distribution Uncertainty Quantification Might Be the Problem

Ashley S. Dale; Kangming Li; Brian DeCost; Hao Wan; Yuchen Han; Yao Fehlis; Jason Hattrick-Simpers

arXiv:2511.17760·cond-mat.mtrl-sci·November 25, 2025

When Active Learning Fails, Uncalibrated Out of Distribution Uncertainty Quantification Might Be the Problem

Ashley S. Dale, Kangming Li, Brian DeCost, Hao Wan, Yuchen Han, Yao Fehlis, Jason Hattrick-Simpers

PDF

Open Access

TL;DR

This paper evaluates the effectiveness of various uncertainty estimation and calibration methods in active learning for materials discovery, highlighting challenges in out-of-distribution uncertainty quantification and its impact on reducing data needs.

Contribution

It systematically compares uncertainty calibration methods across different models and tasks, revealing limitations in out-of-distribution uncertainty estimates for active learning.

Findings

01

Calibrated uncertainties often fail to generalize out-of-domain.

02

Uncalibrated and random sampling sometimes outperform calibrated methods in out-of-distribution scenarios.

03

Intrinsic data and model limitations affect uncertainty quality regardless of model capacity.

Abstract

Efficiently and meaningfully estimating prediction uncertainty is important for exploration in active learning campaigns in materials discovery, where samples with high uncertainty are interpreted as containing information missing from the model. In this work, the effect of different uncertainty estimation and calibration methods are evaluated for active learning when using ensembles of ALIGNN, eXtreme Gradient Boost, Random Forest, and Neural Network model architectures. We compare uncertainty estimates from ALIGNN deep ensembles to loss landscape uncertainty estimates obtained for solubility, bandgap, and formation energy prediction tasks. We then evaluate how the quality of the uncertainty estimate impacts an active learning campaign that seeks model generalization to out-of-distribution data. Uncertainty calibration methods were found to variably generalize from in-domain data to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Machine Learning and Algorithms · Gaussian Processes and Bayesian Inference