Adaptive Bridge between Training and Inference for Dialogue

Haoran Xu; Hainan Zhang; Yanyan Zou; Hongshen Chen; Zhuoye Ding,; Yanyan Lan

arXiv:2110.11560·cs.CL·October 25, 2021

Adaptive Bridge between Training and Inference for Dialogue

Haoran Xu, Hainan Zhang, Yanyan Zou, Hongshen Chen, Zhuoye Ding,, Yanyan Lan

PDF

Open Access

TL;DR

This paper introduces an adaptive switching mechanism to mitigate exposure bias in dialogue response generation, improving response diversity and relevance by balancing ground-truth and generated learning.

Contribution

It proposes a novel adaptive mechanism that dynamically transitions between ground-truth and generated responses based on word-level matching scores, addressing exposure bias in dialogue models.

Findings

01

Significant improvement in metric-based evaluation metrics.

02

Enhanced human evaluation scores for response quality.

03

Effective also in Neural Machine Translation tasks.

Abstract

Although exposure bias has been widely studied in some NLP tasks, it faces its unique challenges in dialogue response generation, the representative one-to-various generation scenario. In real human dialogue, there are many appropriate responses for the same context, not only with different expressions, but also with different topics. Therefore, due to the much bigger gap between various ground-truth responses and the generated synthetic response, exposure bias is more challenging in dialogue generation task. What's more, as MLE encourages the model to only learn the common words among different ground-truth responses, but ignores the interesting and specific parts, exposure bias may further lead to the common response generation problem, such as "I don't know" and "HaHa?" In this paper, we propose a novel adaptive switching mechanism, which learns to automatically transit between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications