Beyond Overconfidence: Foundation Models Redefine Calibration in Deep Neural Networks

Achim Hekler; Lukas Kuhn; and Florian Buettner

arXiv:2506.09593·cs.LG·June 12, 2025

Beyond Overconfidence: Foundation Models Redefine Calibration in Deep Neural Networks

Achim Hekler, Lukas Kuhn, and Florian Buettner

PDF

Open Access

TL;DR

This paper investigates the calibration properties of foundation models like ConvNeXt, EVA, and BEiT, revealing their tendencies towards underconfidence in-distribution and improved calibration under distribution shifts, with implications for deployment safety.

Contribution

It provides the first comprehensive analysis of foundation models' calibration behavior, challenging assumptions of continuous calibration improvements and evaluating post-hoc calibration methods under various conditions.

Findings

01

Foundation models are underconfident in in-distribution predictions.

02

Calibration improves under distribution shifts.

03

Post-hoc calibration methods are less reliable under severe shifts.

Abstract

Reliable uncertainty calibration is essential for safely deploying deep neural networks in high-stakes applications. Deep neural networks are known to exhibit systematic overconfidence, especially under distribution shifts. Although foundation models such as ConvNeXt, EVA and BEiT have demonstrated significant improvements in predictive performance, their calibration properties remain underexplored. This paper presents a comprehensive investigation into the calibration behavior of foundation models, revealing insights that challenge established paradigms. Our empirical analysis shows that these models tend to be underconfident in in-distribution predictions, resulting in higher calibration errors, while demonstrating improved calibration under distribution shifts. Furthermore, we demonstrate that foundation models are highly responsive to post-hoc calibration techniques in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Advanced Neural Network Applications

MethodsConvNeXt