Multiclass Local Calibration with the Jensen-Shannon Distance
Cesare Barbera, Lorenzo Perini, Giovanni De Toni, Andrea Passerini, Andrea Pugnana

TL;DR
This paper introduces a local perspective on multiclass calibration, addressing proximity bias by enforcing alignment between predicted probabilities and local class frequencies using Jensen-Shannon distance, with theoretical and empirical validation.
Contribution
It formally defines multiclass local calibration, analyzes evaluation metrics, and proposes a neural network calibration method based on Jensen-Shannon distance.
Findings
The proposed method improves local calibration accuracy.
Existing metrics can be misleading for local calibration evaluation.
Empirical results show better calibration performance than existing techniques.
Abstract
Developing trustworthy Machine Learning (ML) models requires their predicted probabilities to be well-calibrated, meaning they should reflect true-class frequencies. Among calibration notions in multiclass classification, strong calibration is the most stringent, as it requires all predicted probabilities to be simultaneously calibrated across all classes. However, existing approaches to multiclass calibration lack a notion of distance among inputs, which makes them vulnerable to proximity bias: predictions in sparse regions of the feature space are systematically miscalibrated. In this work, we address this main shortcoming by introducing a local perspective on multiclass calibration. First, we formally define multiclass local calibration and establish its relationship with strong calibration. Second, we theoretically analyze the pitfalls of existing evaluation metrics when applied to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
