Building Advanced Dialogue Managers for Goal-Oriented Dialogue Systems
Vladimir Ilievski

TL;DR
This paper presents a transfer learning approach for goal-oriented dialogue systems that significantly improves success rates and training efficiency, especially in low-data domain scenarios.
Contribution
It introduces a transfer learning method that enhances dialogue policy learning, outperforming non-transfer models in success rate and training speed, especially for closely related domains.
Findings
20% relative success rate improvement in distant domains
More than double success rate in close domains
5 to 10 times faster policy learning
Abstract
Goal-Oriented (GO) Dialogue Systems, colloquially known as goal oriented chatbots, help users achieve a predefined goal (e.g. book a movie ticket) within a closed domain. A first step is to understand the user's goal by using natural language understanding techniques. Once the goal is known, the bot must manage a dialogue to achieve that goal, which is conducted with respect to a learnt policy. The success of the dialogue system depends on the quality of the policy, which is in turn reliant on the availability of high-quality training data for the policy learning method, for instance Deep Reinforcement Learning. Due to the domain specificity, the amount of available data is typically too low to allow the training of good dialogue policies. In this master thesis we introduce a transfer learning method to mitigate the effects of the low in-domain data availability. Our transfer learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Topic Modeling · AI in Service Interactions
