Improving Predictor Reliability with Selective Recalibration

Thomas P. Zollo; Zhun Deng; Jake C. Snell; Toniann Pitassi; Richard; Zemel

arXiv:2410.05407·cs.LG·October 10, 2024

Improving Predictor Reliability with Selective Recalibration

Thomas P. Zollo, Zhun Deng, Jake C. Snell, Toniann Pitassi, Richard, Zemel

PDF

Open Access

TL;DR

This paper introduces selective recalibration, a method that improves confidence calibration in deep learning models by selectively focusing on regions of the input space where recalibration is most effective, especially in complex tasks.

Contribution

The paper proposes a novel selective recalibration approach that learns to reject certain data points, enhancing calibration accuracy over traditional methods.

Findings

01

Significantly reduces calibration error across tasks.

02

Outperforms existing calibration baselines.

03

Effective in medical imaging and zero-shot classification.

Abstract

A reliable deep learning system should be able to accurately express its confidence with respect to its predictions, a quality known as calibration. One of the most effective ways to produce reliable confidence estimates with a pre-trained model is by applying a post-hoc recalibration method. Popular recalibration methods like temperature scaling are typically fit on a small amount of data and work in the model's output space, as opposed to the more expressive feature embedding space, and thus usually have only one or a handful of parameters. However, the target distribution to which they are applied is often complex and difficult to fit well with such a function. To this end we propose \textit{selective recalibration}, where a selection model learns to reject some user-chosen proportion of the data in order to allow the recalibrator to focus on regions of the input space that can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems

MethodsFocus