OmniDialog: An Omnipotent Pre-training Model for Task-Oriented Dialogue System
Mingtao Yang, See-Kiong Ng, Jinlan Fu

TL;DR
OmniDialog is a comprehensive pre-training model for task-oriented dialogue systems that unifies management, generation, and comprehension tasks to improve performance across multiple dialogue tasks.
Contribution
It introduces the first pre-trained dialogue model trained on management, generation, and comprehension tasks using multi-task learning across 7 tasks and 15 datasets.
Findings
Effective in low-resource scenarios
Excels on long dialogues and responses
Improves domain transfer learning
Abstract
Pre-trained conversation models (PCMs) have demonstrated remarkable results in task-oriented dialogue (TOD) systems. Many PCMs focus predominantly on dialogue management tasks like dialogue state tracking, dialogue generation tasks like response generation, or both. However, the existing PCMs seldom consider dialogue comprehension tasks, such as dialogue question answering and summarization tasks. These tasks allow PCMs to glean dialogue context from various angles. This observation naturally raises the question: Can the performance of downstream dialogue tasks be enhanced if a PCM is pre-trained on dialogue management, generation, and comprehension tasks? To investigate this, we proposed an Omnipotent Dialogue pre-training model (OmniDialog). It unifies these three dialogue tasks into a monolithic framework by multi-task learning, fostering inter-task communication. The pre-training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
MethodsFocus
