SPACE-3: Unified Dialog Model Pre-training for Task-Oriented Dialog   Understanding and Generation

Wanwei He; Yinpei Dai; Min Yang; Jian Sun; Fei Huang; Luo Si; Yongbin; Li

arXiv:2209.06664·cs.CL·September 15, 2022·6 cites

SPACE-3: Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation

Wanwei He, Yinpei Dai, Min Yang, Jian Sun, Fei Huang, Luo Si, Yongbin, Li

PDF

Open Access 1 Repo

TL;DR

SPACE-3 is a comprehensive unified pre-training framework for task-oriented dialog systems that effectively integrates understanding and generation, leading to state-of-the-art results across multiple benchmarks and enhanced few-shot capabilities.

Contribution

The paper introduces SPACE-3, a novel semi-supervised pre-trained dialog model with four integrated components and specialized pre-training objectives for each, enabling effective fine-tuning on diverse dialog tasks.

Findings

01

Achieves state-of-the-art performance on eight dialog benchmarks.

02

Demonstrates superior few-shot learning in low-resource settings.

03

Effectively combines understanding and generation in a unified model.

Abstract

Recently, pre-training methods have shown remarkable success in task-oriented dialog (TOD) systems. However, most existing pre-trained models for TOD focus on either dialog understanding or dialog generation, but not both. In this paper, we propose SPACE-3, a novel unified semi-supervised pre-trained conversation model learning from large-scale dialog corpora with limited annotations, which can be effectively fine-tuned on a wide range of downstream dialog tasks. Specifically, SPACE-3 consists of four successive components in a single transformer to maintain a task-flow in TOD systems: (i) a dialog encoding module to encode dialog history, (ii) a dialog understanding module to extract semantic vectors from either user queries or system responses, (iii) a dialog policy module to generate a policy vector that contains high-level semantics of the response, and (iv) a dialog generation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alibabaresearch/damo-convai
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · AI in Service Interactions

MethodsContrastive Learning