Adaptive Label Smoothing with Self-Knowledge in Natural Language Generation
Dongkyu Lee, Ka Chun Cheung, Nevin L. Zhang

TL;DR
This paper introduces an adaptive label smoothing method that dynamically adjusts smoothing based on the model's own probability distribution, improving generalization and calibration in natural language generation tasks.
Contribution
It proposes a novel regularization scheme that varies label smoothing per instance using self-knowledge, supported by theoretical analysis and empirical validation.
Findings
Improved model calibration and robustness.
Enhanced generalization performance.
Effective in natural language generation tasks.
Abstract
Overconfidence has been shown to impair generalization and calibration of a neural network. Previous studies remedy this issue by adding a regularization term to a loss function, preventing a model from making a peaked distribution. Label smoothing smoothes target labels with a pre-defined prior label distribution; as a result, a model is learned to maximize the likelihood of predicting the soft label. Nonetheless, the amount of smoothing is the same in all samples and remains fixed in training. In other words, label smoothing does not reflect the change in probability distribution mapped by a model over the course of training. To address this issue, we propose a regularization scheme that brings dynamic nature into the smoothing parameter by taking model probability distribution into account, thereby varying the parameter per instance. A model in training self-regulates the extent of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Multimodal Machine Learning Applications · Advanced Neural Network Applications
MethodsKnowledge Distillation · Label Smoothing
