Diversifying Dialog Generation via Adaptive Label Smoothing
Yida Wang, Yinhe Zheng, Yong Jiang, Minlie Huang

TL;DR
This paper introduces AdaLabel, an adaptive label smoothing technique for neural dialogue models that enhances response diversity by dynamically adjusting target distributions based on context.
Contribution
The paper proposes a novel adaptive label smoothing method that estimates context-aware target distributions using a lightweight bi-directional decoder, improving dialogue response diversity.
Findings
Outperforms baseline models in response diversity
Effectively reduces over-confidence in dialogue generation
Achieves superior results on benchmark datasets
Abstract
Neural dialogue generation models trained with the one-hot target distribution suffer from the over-confidence issue, which leads to poor generation diversity as widely reported in the literature. Although existing approaches such as label smoothing can alleviate this issue, they fail to adapt to diverse dialog contexts. In this paper, we propose an Adaptive Label Smoothing (AdaLabel) approach that can adaptively estimate a target label distribution at each time step for different contexts. The maximum probability in the predicted distribution is used to modify the soft target distribution produced by a novel light-weight bi-directional decoder module. The resulting target distribution is aware of both previous and future contexts and is adjusted to avoid over-training the dialogue model. Our model can be trained in an end-to-end manner. Extensive experiments on two benchmark datasets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Multimodal Machine Learning Applications
MethodsAttentive Walk-Aggregating Graph Neural Network · Label Smoothing
