Personalizing a Dialogue System with Transfer Reinforcement Learning
Kaixiang Mo, Shuangyin Li, Yu Zhang, Jiajun Li, Qiang Yang

TL;DR
This paper introduces PETAL, a transfer learning framework based on POMDPs, to personalize task-oriented dialogue systems by leveraging multi-user data and adapting to individual users, improving dialogue quality.
Contribution
It proposes a novel transfer learning approach for personalized dialogue systems using POMDPs, effectively addressing data scarcity and negative transfer issues.
Findings
Improved dialogue quality in personalized settings
Effective adaptation to individual user preferences
Successful application on real-world coffee-shopping data
Abstract
It is difficult to train a personalized task-oriented dialogue system because the data collected from each individual is often insufficient. Personalized dialogue systems trained on a small dataset can overfit and make it difficult to adapt to different user needs. One way to solve this problem is to consider a collection of multiple users' data as a source domain and an individual user's data as a target domain, and to perform a transfer learning from the source to the target domain. By following this idea, we propose "PETAL"(PErsonalized Task-oriented diALogue), a transfer-learning framework based on POMDP to learn a personalized dialogue system. The system first learns common dialogue knowledge from the source domain and then adapts this knowledge to the target user. This framework can avoid the negative transfer problem by considering differences between source and target users. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Topic Modeling · Speech Recognition and Synthesis
