Personalizing a Dialogue System with Transfer Reinforcement Learning

Kaixiang Mo; Shuangyin Li; Yu Zhang; Jiajun Li; Qiang Yang

arXiv:1610.02891·cs.AI·May 29, 2017·23 cites

Personalizing a Dialogue System with Transfer Reinforcement Learning

Kaixiang Mo, Shuangyin Li, Yu Zhang, Jiajun Li, Qiang Yang

PDF

Open Access

TL;DR

This paper introduces PETAL, a transfer learning framework based on POMDPs, to personalize task-oriented dialogue systems by leveraging multi-user data and adapting to individual users, improving dialogue quality.

Contribution

It proposes a novel transfer learning approach for personalized dialogue systems using POMDPs, effectively addressing data scarcity and negative transfer issues.

Findings

01

Improved dialogue quality in personalized settings

02

Effective adaptation to individual user preferences

03

Successful application on real-world coffee-shopping data

Abstract

It is difficult to train a personalized task-oriented dialogue system because the data collected from each individual is often insufficient. Personalized dialogue systems trained on a small dataset can overfit and make it difficult to adapt to different user needs. One way to solve this problem is to consider a collection of multiple users' data as a source domain and an individual user's data as a target domain, and to perform a transfer learning from the source to the target domain. By following this idea, we propose "PETAL"(PErsonalized Task-oriented diALogue), a transfer-learning framework based on POMDP to learn a personalized dialogue system. The system first learns common dialogue knowledge from the source domain and then adapts this knowledge to the target user. This framework can avoid the negative transfer problem by considering differences between source and target users. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Topic Modeling · Speech Recognition and Synthesis