Calibration by Distribution Matching: Trainable Kernel Calibration Metrics
Charles Marx, Sofian Zalouk, Stefano Ermon

TL;DR
This paper introduces trainable, kernel-based calibration metrics that unify and generalize calibration methods, enabling integrated optimization for better probabilistic forecasts in classification and regression tasks.
Contribution
It proposes differentiable, flexible calibration metrics that can be incorporated into training, improving calibration and decision-making over traditional post-hoc methods.
Findings
Enhanced calibration and sharpness in forecasts.
Improved decision-making performance.
Outperforms post-hoc recalibration methods.
Abstract
Calibration ensures that probabilistic forecasts meaningfully capture uncertainty by requiring that predicted probabilities align with empirical frequencies. However, many existing calibration methods are specialized for post-hoc recalibration, which can worsen the sharpness of forecasts. Drawing on the insight that calibration can be viewed as a distribution matching task, we introduce kernel-based calibration metrics that unify and generalize popular forms of calibration for both classification and regression. These metrics admit differentiable sample estimates, making it easy to incorporate a calibration objective into empirical risk minimization. Furthermore, we provide intuitive mechanisms to tailor calibration metrics to a decision task, and enforce accurate loss estimation and no regret decisions. Our empirical evaluation demonstrates that employing these metrics as regularizers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsForecasting Techniques and Applications · Time Series Analysis and Forecasting · Gaussian Processes and Bayesian Inference
MethodsALIGN
