GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning and Explicit Policy Injection
Wanwei He, Yinpei Dai, Yinhe Zheng, Yuchuan Wu, Zheng Cao, Dermot Liu,, Peng Jiang, Min Yang, Fei Huang, Luo Si, Jian Sun, Yongbin Li

TL;DR
GALAXY is a pre-trained dialog model that explicitly learns dialog policy using semi-supervised learning, improving task-oriented dialog performance and few-shot capabilities on benchmark datasets.
Contribution
It introduces a novel semi-supervised pre-training approach with explicit policy learning and regularization, achieving state-of-the-art results in task-oriented dialog systems.
Findings
Significantly improves performance on benchmark datasets
Achieves new state-of-the-art scores
Demonstrates strong few-shot learning ability
Abstract
Pre-trained models have proved to be powerful in enhancing task-oriented dialog systems. However, current pre-training methods mainly focus on enhancing dialog understanding and generation tasks while neglecting the exploitation of dialog policy. In this paper, we propose GALAXY, a novel pre-trained dialog model that explicitly learns dialog policy from limited labeled dialogs and large-scale unlabeled dialog corpora via semi-supervised learning. Specifically, we introduce a dialog act prediction task for policy optimization during pre-training and employ a consistency regularization term to refine the learned representation with the help of unlabeled dialogs. We also implement a gating mechanism to weigh suitable unlabeled dialog samples. Empirical results show that GALAXY substantially improves the performance of task-oriented dialog systems, and achieves new state-of-the-art results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · AI in Service Interactions
