Structured Matrix Scaling for Multi-Class Calibration

Eug\`ene Berta; David Holzm\"uller; Michael I. Jordan; Francis Bach

arXiv:2511.03685·cs.LG·March 11, 2026

Structured Matrix Scaling for Multi-Class Calibration

Eug\`ene Berta, David Holzm\"uller, Michael I. Jordan, Francis Bach

PDF

Open Access

TL;DR

This paper introduces structured matrix scaling methods for multi-class calibration that improve probability estimates of classifiers by managing bias-variance tradeoffs with regularization and optimization, outperforming existing techniques.

Contribution

It proposes a novel structured matrix scaling approach for multi-class calibration, addressing overfitting issues and providing efficient, open-source implementations.

Findings

01

Structured regularization improves calibration accuracy.

02

Enhanced calibration methods outperform existing temperature and matrix scaling.

03

Effective bias-variance tradeoff management leads to substantial gains.

Abstract

Post-hoc recalibration methods are widely used to ensure that classifiers provide faithful probability estimates. We argue that parametric recalibration functions based on logistic regression can be motivated from a simple theoretical setting for both binary and multiclass classification. This insight motivates the use of more expressive calibration methods beyond standard temperature scaling. For multi-class calibration however, a key challenge lies in the increasing number of parameters introduced by more complex models, often coupled with limited calibration data, which can lead to overfitting. Through extensive experiments, we demonstrate that the resulting bias-variance tradeoff can be effectively managed by structured regularization, robust preprocessing and efficient optimization. The resulting methods lead to substantial gains over existing logistic-based calibration techniques.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications