GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with   Semi-Supervised Learning and Explicit Policy Injection

Wanwei He; Yinpei Dai; Yinhe Zheng; Yuchuan Wu; Zheng Cao; Dermot Liu,; Peng Jiang; Min Yang; Fei Huang; Luo Si; Jian Sun; Yongbin Li

arXiv:2111.14592·cs.CL·March 30, 2022·45 cites

GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning and Explicit Policy Injection

Wanwei He, Yinpei Dai, Yinhe Zheng, Yuchuan Wu, Zheng Cao, Dermot Liu,, Peng Jiang, Min Yang, Fei Huang, Luo Si, Jian Sun, Yongbin Li

PDF

Open Access 1 Repo 1 Video

TL;DR

GALAXY is a pre-trained dialog model that explicitly learns dialog policy using semi-supervised learning, improving task-oriented dialog performance and few-shot capabilities on benchmark datasets.

Contribution

It introduces a novel semi-supervised pre-training approach with explicit policy learning and regularization, achieving state-of-the-art results in task-oriented dialog systems.

Findings

01

Significantly improves performance on benchmark datasets

02

Achieves new state-of-the-art scores

03

Demonstrates strong few-shot learning ability

Abstract

Pre-trained models have proved to be powerful in enhancing task-oriented dialog systems. However, current pre-training methods mainly focus on enhancing dialog understanding and generation tasks while neglecting the exploitation of dialog policy. In this paper, we propose GALAXY, a novel pre-trained dialog model that explicitly learns dialog policy from limited labeled dialogs and large-scale unlabeled dialog corpora via semi-supervised learning. Specifically, we introduce a dialog act prediction task for policy optimization during pre-training and employ a consistency regularization term to refine the learned representation with the help of unlabeled dialogs. We also implement a gating mechanism to weigh suitable unlabeled dialog samples. Empirical results show that GALAXY substantially improves the performance of task-oriented dialog systems, and achieves new state-of-the-art results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

siat-nlp/galaxy
pytorchOfficial

Videos

GALAXY: A Generative Pre-Trained Model for Task-Oriented Dialog with Semi-Supervised Learning and Explicit Policy Injection· underline

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · AI in Service Interactions