Calibrating Where It Matters: Constrained Temperature Scaling
Stephen McKenna, Jacob Carse

TL;DR
This paper proposes a modified temperature scaling method for calibrating convolutional classifiers, focusing on regions of the probability space that influence decision-making, demonstrated on dermoscopy image classification.
Contribution
It introduces a calibration approach that emphasizes decision-critical regions, improving calibration effectiveness in practical diagnostic scenarios.
Findings
Enhanced calibration in decision-critical regions
Improved decision-making accuracy in dermoscopy classification
Focus on cost-sensitive calibration regions
Abstract
We consider calibration of convolutional classifiers for diagnostic decision making. Clinical decision makers can use calibrated classifiers to minimise expected costs given their own cost function. Such functions are usually unknown at training time. If minimising expected costs is the primary aim, algorithms should focus on tuning calibration in regions of probability simplex likely to effect decisions. We give an example, modifying temperature scaling calibration, and demonstrate improved calibration where it matters using convnets trained to classify dermoscopy images.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHeat Transfer and Optimization · Thermography and Photoacoustic Techniques
MethodsFocus
