Confidence-Aware Calibration and Scoring Functions for Curriculum Learning
Shuang Ao, Stefan Rueger, Advaith Siddharthan

TL;DR
This paper introduces confidence-aware label smoothing methods that incorporate model and human confidence scores to improve calibration and generalization in deep neural networks, especially within curriculum learning strategies.
Contribution
It proposes novel confidence-aware label smoothing techniques and demonstrates their effectiveness in enhancing model calibration and performance in curriculum learning.
Findings
Improved model calibration with confidence-aware label smoothing.
Enhanced performance in image and text classification tasks.
Better sample ranking for curriculum learning using confidence scores.
Abstract
Despite the great success of state-of-the-art deep neural networks, several studies have reported models to be over-confident in predictions, indicating miscalibration. Label Smoothing has been proposed as a solution to the over-confidence problem and works by softening hard targets during training, typically by distributing part of the probability mass from a `one-hot' label uniformly to all other labels. However, neither model nor human confidence in a label are likely to be uniformly distributed in this manner, with some labels more likely to be confused than others. In this paper we integrate notions of model confidence and human confidence with label smoothing, respectively \textit{Model Confidence LS} and \textit{Human Confidence LS}, to achieve better model calibration and generalization. To enhance model generalization, we show how our model and human confidence scores can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning
MethodsLabel Smoothing
