SPACE-3: Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation
Wanwei He, Yinpei Dai, Min Yang, Jian Sun, Fei Huang, Luo Si, Yongbin, Li

TL;DR
SPACE-3 is a comprehensive unified pre-training framework for task-oriented dialog systems that effectively integrates understanding and generation, leading to state-of-the-art results across multiple benchmarks and enhanced few-shot capabilities.
Contribution
The paper introduces SPACE-3, a novel semi-supervised pre-trained dialog model with four integrated components and specialized pre-training objectives for each, enabling effective fine-tuning on diverse dialog tasks.
Findings
Achieves state-of-the-art performance on eight dialog benchmarks.
Demonstrates superior few-shot learning in low-resource settings.
Effectively combines understanding and generation in a unified model.
Abstract
Recently, pre-training methods have shown remarkable success in task-oriented dialog (TOD) systems. However, most existing pre-trained models for TOD focus on either dialog understanding or dialog generation, but not both. In this paper, we propose SPACE-3, a novel unified semi-supervised pre-trained conversation model learning from large-scale dialog corpora with limited annotations, which can be effectively fine-tuned on a wide range of downstream dialog tasks. Specifically, SPACE-3 consists of four successive components in a single transformer to maintain a task-flow in TOD systems: (i) a dialog encoding module to encode dialog history, (ii) a dialog understanding module to extract semantic vectors from either user queries or system responses, (iii) a dialog policy module to generate a policy vector that contains high-level semantics of the response, and (iv) a dialog generation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · AI in Service Interactions
MethodsContrastive Learning
