Simple Calibration via Geodesic Kernels
Jayanta Dey, Haoyin Xu, Ashwin De Silva, Joshua T. Vogelstein

TL;DR
This paper introduces a novel calibration method using geodesic kernels that improves the reliability of deep models' confidence estimates across both in-distribution and out-of-distribution data, crucial for safety-critical applications.
Contribution
It proposes a new calibration approach leveraging geodesic distances and Gaussian kernels within the polytopic partitions learned by deep models, addressing both ID and OOD calibration challenges.
Findings
KDF and KDN achieve well-calibrated posteriors for ID and OOD data.
The methods preserve classification accuracy while extrapolating beyond training data.
Experiments on tabular and vision benchmarks validate effectiveness.
Abstract
Deep discriminative approaches, such as decision forests and deep neural networks, have recently found applications in many important real-world scenarios. However, deploying these learning algorithms in safety-critical applications raises concerns, particularly when it comes to ensuring calibration for both in-distribution and out-of-distribution regions. Many popular methods for in-distribution (ID) calibration, such as isotonic and Platt's sigmoidal regression, exhibit adequate ID calibration performance. However, these methods are not calibrated for the entire feature space, leading to overconfidence in the out-of-distribution (OOD) region. Existing OOD calibration methods generally exhibit poor ID calibration. In this paper, we jointly address the ID and OOD problems. We leveraged the fact that deep models learn to partition feature space into a union of polytopes, that is,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Neural Networks and Applications · Gaussian Processes and Bayesian Inference
