Adaptive Bridge between Training and Inference for Dialogue
Haoran Xu, Hainan Zhang, Yanyan Zou, Hongshen Chen, Zhuoye Ding,, Yanyan Lan

TL;DR
This paper introduces an adaptive switching mechanism to mitigate exposure bias in dialogue response generation, improving response diversity and relevance by balancing ground-truth and generated learning.
Contribution
It proposes a novel adaptive mechanism that dynamically transitions between ground-truth and generated responses based on word-level matching scores, addressing exposure bias in dialogue models.
Findings
Significant improvement in metric-based evaluation metrics.
Enhanced human evaluation scores for response quality.
Effective also in Neural Machine Translation tasks.
Abstract
Although exposure bias has been widely studied in some NLP tasks, it faces its unique challenges in dialogue response generation, the representative one-to-various generation scenario. In real human dialogue, there are many appropriate responses for the same context, not only with different expressions, but also with different topics. Therefore, due to the much bigger gap between various ground-truth responses and the generated synthetic response, exposure bias is more challenging in dialogue generation task. What's more, as MLE encourages the model to only learn the common words among different ground-truth responses, but ignores the interesting and specific parts, exposure bias may further lead to the common response generation problem, such as "I don't know" and "HaHa?" In this paper, we propose a novel adaptive switching mechanism, which learns to automatically transit between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
