Instance-based Label Smoothing For Better Calibrated Classification   Networks

Mohamed Maher; Meelis Kull

arXiv:2110.05355·cs.LG·October 12, 2021

Instance-based Label Smoothing For Better Calibrated Classification Networks

Mohamed Maher, Meelis Kull

PDF

1 Repo

TL;DR

This paper introduces two novel instance-based label smoothing methods that improve calibration and generalization in neural networks by customizing smoothing per instance based on class similarity.

Contribution

It proposes new label smoothing techniques inspired by self-distillation, assigning non-uniform smoothing factors based on class similarity for better calibration.

Findings

01

Enhanced model calibration and generalization over standard label smoothing.

02

Effective across various neural architectures and image datasets.

Abstract

Label smoothing is widely used in deep neural networks for multi-class classification. While it enhances model generalization and reduces overconfidence by aiming to lower the probability for the predicted class, it distorts the predicted probabilities of other classes resulting in poor class-wise calibration. Another method for enhancing model generalization is self-distillation where the predictions of a teacher network trained with one-hot labels are used as the target for training a student network. We take inspiration from both label smoothing and self-distillation and propose two novel instance-based label smoothing approaches, where a teacher network trained with hard one-hot labels is used to determine the amount of per class smoothness applied to each instance. The assigned smoothing factor is non-uniformly distributed along with the classes according to their similarity with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mmaher22/instance-based-label-smoothing
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLabel Smoothing