A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems
Hong Liu, Yucheng Cai, Zhijian Ou, Yi Huang, Junlan Feng

TL;DR
This paper introduces a GPT-2 based generative user simulator with goal state tracking for multi-domain dialog systems, demonstrating superior performance over traditional methods in various evaluations.
Contribution
It presents a novel GPT-2 based user simulator with integrated goal tracking, enabling effective multi-domain dialog system training and evaluation.
Findings
GUS outperforms classic agenda-based simulators in multiple evaluations.
The GPT-2 based architecture effectively captures user behavior in dialog.
End-to-end trainability enhances multi-domain dialog system development.
Abstract
Building user simulators (USs) for reinforcement learning (RL) of task-oriented dialog systems (DSs) has gained more and more attention, which, however, still faces several fundamental challenges. First, it is unclear whether we can leverage pretrained language models to design, for example, GPT-2 based USs, to catch up and interact with the recently advanced GPT-2 based DSs. Second, an important ingredient in a US is that the user goal can be effectively incorporated and tracked; but how to flexibly integrate goal state tracking and develop an end-to-end trainable US for multi-domains has remained to be a challenge. In this work, we propose a generative user simulator (GUS) with GPT-2 based architecture and goal state tracking towards addressing the above two challenges. Extensive experiments are conducted on MultiWOZ2.1. Different DSs are trained via RL with GUS, the classic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Topic Modeling · AI in Service Interactions
MethodsMulti-Head Attention · Attention Is All You Need · Byte Pair Encoding · Layer Normalization · Cosine Annealing · Residual Connection · Dropout · Dense Connections · Discriminative Fine-Tuning · Weight Decay
