Goal-Oriented Chatbot Dialog Management Bootstrapping with Transfer Learning
Vladimir Ilievski, Claudiu Musat, Andreea Hossmann, Michael Baeriswyl

TL;DR
This paper presents a transfer learning approach to improve goal-oriented chatbot dialogue management, significantly enhancing success rates and training efficiency, especially in low-data domain scenarios.
Contribution
Introduces a transfer learning method that boosts dialogue policy performance and training speed in goal-oriented chatbots with limited domain data.
Findings
20% relative success rate improvement in distant domains
More than double success rate in close domains
Policy learning speed increased by 5 to 10 times
Abstract
Goal-Oriented (GO) Dialogue Systems, colloquially known as goal oriented chatbots, help users achieve a predefined goal (e.g. book a movie ticket) within a closed domain. A first step is to understand the user's goal by using natural language understanding techniques. Once the goal is known, the bot must manage a dialogue to achieve that goal, which is conducted with respect to a learnt policy. The success of the dialogue system depends on the quality of the policy, which is in turn reliant on the availability of high-quality training data for the policy learning method, for instance Deep Reinforcement Learning. Due to the domain specificity, the amount of available data is typically too low to allow the training of good dialogue policies. In this paper we introduce a transfer learning method to mitigate the effects of the low in-domain data availability. Our transfer learning based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
