Rethinking Early Stopping: Refine, Then Calibrate

Eug\`ene Berta; David Holzm\"uller; Michael I. Jordan; Francis Bach

arXiv:2501.19195·cs.LG·June 26, 2025

Rethinking Early Stopping: Refine, Then Calibrate

Eug\`ene Berta, David Holzm\"uller, Michael I. Jordan, Francis Bach

PDF

Open Access 4 Repos

TL;DR

This paper introduces a new perspective on calibration and refinement in probabilistic classifiers, proposing a two-stage training process that improves prediction quality by separately optimizing these components.

Contribution

It presents a variational formulation of calibration-refinement decomposition and a novel training method that separately minimizes refinement and calibration errors.

Findings

01

The proposed method improves calibration and refinement in classifiers.

02

Calibration and refinement errors are not minimized simultaneously during training.

03

Separately optimizing refinement and calibration yields better probabilistic predictions.

Abstract

Machine learning classifiers often produce probabilistic predictions that are critical for accurate and interpretable decision-making in various domains. The quality of these predictions is generally evaluated with proper losses, such as cross-entropy, which decompose into two components: calibration error assesses general under/overconfidence, while refinement error measures the ability to distinguish different classes. In this paper, we present a novel variational formulation of the calibration-refinement decomposition that sheds new light on post-hoc calibration, and enables rapid estimation of the different terms. Equipped with this new perspective, we provide theoretical and empirical evidence that calibration and refinement errors are not minimized simultaneously during training. Selecting the best epoch based on validation loss thus leads to a compromise point that is suboptimal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications

MethodsEarly Stopping