A Generative User Simulator with GPT-based Architecture and Goal State   Tracking for Reinforced Multi-Domain Dialog Systems

Hong Liu; Yucheng Cai; Zhijian Ou; Yi Huang; Junlan Feng

arXiv:2210.08692·cs.CL·October 19, 2022

A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems

Hong Liu, Yucheng Cai, Zhijian Ou, Yi Huang, Junlan Feng

PDF

Open Access 1 Repo

TL;DR

This paper introduces a GPT-2 based generative user simulator with goal state tracking for multi-domain dialog systems, demonstrating superior performance over traditional methods in various evaluations.

Contribution

It presents a novel GPT-2 based user simulator with integrated goal tracking, enabling effective multi-domain dialog system training and evaluation.

Findings

01

GUS outperforms classic agenda-based simulators in multiple evaluations.

02

The GPT-2 based architecture effectively captures user behavior in dialog.

03

End-to-end trainability enhances multi-domain dialog system development.

Abstract

Building user simulators (USs) for reinforcement learning (RL) of task-oriented dialog systems (DSs) has gained more and more attention, which, however, still faces several fundamental challenges. First, it is unclear whether we can leverage pretrained language models to design, for example, GPT-2 based USs, to catch up and interact with the recently advanced GPT-2 based DSs. Second, an important ingredient in a US is that the user goal can be effectively incorporated and tracked; but how to flexibly integrate goal state tracking and develop an end-to-end trainable US for multi-domains has remained to be a challenge. In this work, we propose a generative user simulator (GUS) with GPT-2 based architecture and goal state tracking towards addressing the above two challenges. Extensive experiments are conducted on MultiWOZ2.1. Different DSs are trained via RL with GUS, the classic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thu-spmi/gus
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Topic Modeling · AI in Service Interactions

MethodsMulti-Head Attention · Attention Is All You Need · Byte Pair Encoding · Layer Normalization · Cosine Annealing · Residual Connection · Dropout · Dense Connections · Discriminative Fine-Tuning · Weight Decay