Beyond Overconfidence: Foundation Models Redefine Calibration in Deep Neural Networks
Achim Hekler, Lukas Kuhn, and Florian Buettner

TL;DR
This paper investigates the calibration properties of foundation models like ConvNeXt, EVA, and BEiT, revealing their tendencies towards underconfidence in-distribution and improved calibration under distribution shifts, with implications for deployment safety.
Contribution
It provides the first comprehensive analysis of foundation models' calibration behavior, challenging assumptions of continuous calibration improvements and evaluating post-hoc calibration methods under various conditions.
Findings
Foundation models are underconfident in in-distribution predictions.
Calibration improves under distribution shifts.
Post-hoc calibration methods are less reliable under severe shifts.
Abstract
Reliable uncertainty calibration is essential for safely deploying deep neural networks in high-stakes applications. Deep neural networks are known to exhibit systematic overconfidence, especially under distribution shifts. Although foundation models such as ConvNeXt, EVA and BEiT have demonstrated significant improvements in predictive performance, their calibration properties remain underexplored. This paper presents a comprehensive investigation into the calibration behavior of foundation models, revealing insights that challenge established paradigms. Our empirical analysis shows that these models tend to be underconfident in in-distribution predictions, resulting in higher calibration errors, while demonstrating improved calibration under distribution shifts. Furthermore, we demonstrate that foundation models are highly responsive to post-hoc calibration techniques in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Advanced Neural Network Applications
MethodsConvNeXt
