Improving Predictor Reliability with Selective Recalibration
Thomas P. Zollo, Zhun Deng, Jake C. Snell, Toniann Pitassi, Richard, Zemel

TL;DR
This paper introduces selective recalibration, a method that improves confidence calibration in deep learning models by selectively focusing on regions of the input space where recalibration is most effective, especially in complex tasks.
Contribution
The paper proposes a novel selective recalibration approach that learns to reject certain data points, enhancing calibration accuracy over traditional methods.
Findings
Significantly reduces calibration error across tasks.
Outperforms existing calibration baselines.
Effective in medical imaging and zero-shot classification.
Abstract
A reliable deep learning system should be able to accurately express its confidence with respect to its predictions, a quality known as calibration. One of the most effective ways to produce reliable confidence estimates with a pre-trained model is by applying a post-hoc recalibration method. Popular recalibration methods like temperature scaling are typically fit on a small amount of data and work in the model's output space, as opposed to the more expressive feature embedding space, and thus usually have only one or a handful of parameters. However, the target distribution to which they are applied is often complex and difficult to fit well with such a function. To this end we propose \textit{selective recalibration}, where a selection model learns to reject some user-chosen proportion of the data in order to allow the recalibrator to focus on regions of the input space that can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems
MethodsFocus
