Two Sides of Miscalibration: Identifying Over and Under-Confidence   Prediction for Network Calibration

Shuang Ao; Stefan Rueger; Advaith Siddharthan

arXiv:2308.03172·cs.LG·August 8, 2023·2 cites

Two Sides of Miscalibration: Identifying Over and Under-Confidence Prediction for Network Calibration

Shuang Ao, Stefan Rueger, Advaith Siddharthan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new metric to identify over and under-confidence in neural network calibration, revealing limitations of existing methods and proposing a technique that improves calibration and failure detection.

Contribution

The paper presents a novel miscalibration score to detect both over and under-confidence, and a calibration method that addresses these issues, outperforming existing techniques.

Findings

01

The miscalibration score effectively identifies calibration issues.

02

The proposed calibration method outperforms existing techniques.

03

Improved failure detection and trustworthiness in neural networks.

Abstract

Proper confidence calibration of deep neural networks is essential for reliable predictions in safety-critical tasks. Miscalibration can lead to model over-confidence and/or under-confidence; i.e., the model's confidence in its prediction can be greater or less than the model's accuracy. Recent studies have highlighted the over-confidence issue by introducing calibration techniques and demonstrated success on various tasks. However, miscalibration through under-confidence has not yet to receive much attention. In this paper, we address the necessity of paying attention to the under-confidence issue. We first introduce a novel metric, a miscalibration score, to identify the overall and class-wise calibration status, including being over or under-confident. Our proposed metric reveals the pitfalls of existing calibration techniques, where they often overly calibrate the model and worsen…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aoshuang92/miscalibration_ts
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI)