Uncertainty-Aware Post-Hoc Calibration: Mitigating Confidently Incorrect Predictions Beyond Calibration Metrics

Hassan Gharoun; Mohammad Sadegh Khorshidi; Kasra Ranjbarigderi; Fang Chen; Amir H. Gandomi

arXiv:2510.17915·cs.LG·October 22, 2025

Uncertainty-Aware Post-Hoc Calibration: Mitigating Confidently Incorrect Predictions Beyond Calibration Metrics

Hassan Gharoun, Mohammad Sadegh Khorshidi, Kasra Ranjbarigderi, Fang Chen, Amir H. Gandomi

PDF

Open Access

TL;DR

This paper introduces a post-hoc calibration method that improves neural network confidence estimates and uncertainty-aware decision-making by stratifying predictions into correct and incorrect groups using conformal prediction, without retraining the model.

Contribution

It proposes a novel dual calibration framework that adaptively calibrates predictions based on their estimated correctness, enhancing calibration and uncertainty quantification without retraining.

Findings

01

Lower confidently incorrect predictions on CIFAR datasets

02

Competitive Expected Calibration Error compared to baselines

03

Effective instance-level calibration improving uncertainty estimates

Abstract

Despite extensive research on neural network calibration, existing methods typically apply global transformations that treat all predictions uniformly, overlooking the heterogeneous reliability of individual predictions. Furthermore, the relationship between improved calibration and effective uncertainty-aware decision-making remains largely unexplored. This paper presents a post-hoc calibration framework that leverages prediction reliability assessment to jointly enhance calibration quality and uncertainty-aware decision-making. The framework employs proximity-based conformal prediction to stratify calibration samples into putatively correct and putatively incorrect groups based on semantic similarity in feature space. A dual calibration strategy is then applied: standard isotonic regression calibrated confidence in putatively correct predictions, while underconfidence-regularized…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Machine Learning and Data Classification