Data-Efficient Goal-Oriented Conversation with Dialogue Knowledge Transfer Networks
Igor Shalyminov, Sungjin Lee, Arash Eshghi, and Oliver Lemon

TL;DR
This paper introduces DiKTNet, a goal-oriented dialogue system that effectively learns from minimal data through a two-stage transfer learning approach, significantly reducing data requirements while maintaining high performance.
Contribution
The paper presents DiKTNet, a novel transfer learning framework that leverages unsupervised dialogue representation pre-training and multi-source knowledge integration for few-shot dialogue generation.
Findings
Outperforms baseline models and previous state-of-the-art by up to 10% in Entity F1.
Achieves 3% improvement in BLEU score with only 10% of the training data.
Demonstrates effective transfer learning in goal-oriented dialogue systems.
Abstract
Goal-oriented dialogue systems are now being widely adopted in industry where it is of key importance to maintain a rapid prototyping cycle for new products and domains. Data-driven dialogue system development has to be adapted to meet this requirement --- therefore, reducing the amount of data and annotations necessary for training such systems is a central research problem. In this paper, we present the Dialogue Knowledge Transfer Network (DiKTNet), a state-of-the-art approach to goal-oriented dialogue generation which only uses a few example dialogues (i.e. few-shot learning), none of which has to be annotated. We achieve this by performing a 2-stage training. Firstly, we perform unsupervised dialogue representation pre-training on a large source of goal-oriented dialogues in multiple domains, the MetaLWOz corpus. Secondly, at the transfer stage, we train DiKTNet using this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Bidirectional LSTM · Softmax · ELMo
