Task-Oriented Dialog Systems that Consider Multiple Appropriate Responses under the Same Context
Yichi Zhang, Zhijian Ou, Zhou Yu

TL;DR
This paper introduces a novel data augmentation framework for task-oriented dialog systems that leverages the one-to-many property of conversations to generate diverse, appropriate responses, improving policy diversity and response quality.
Contribution
The proposed Multi-Action Data Augmentation (MADA) framework explicitly models multiple valid responses for each dialog state, enhancing diversity and performance in task-oriented dialog systems.
Findings
Improves dialog policy diversity and response appropriateness
Achieves state-of-the-art results on MultiWOZ dataset
Enhances training with augmented state-action pairs
Abstract
Conversations have an intrinsic one-to-many property, which means that multiple responses can be appropriate for the same dialog context. In task-oriented dialogs, this property leads to different valid dialog policies towards task completion. However, none of the existing task-oriented dialog generation approaches takes this property into account. We propose a Multi-Action Data Augmentation (MADA) framework to utilize the one-to-many property to generate diverse appropriate dialog responses. Specifically, we first use dialog states to summarize the dialog history, and then discover all possible mappings from every dialog state to its different valid system actions. During dialog system training, we enable the current dialog state to map to all valid system actions discovered in the previous process to create additional state-action pairs. By incorporating these additional pairs, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
