Data-Efficient Goal-Oriented Conversation with Dialogue Knowledge   Transfer Networks

Igor Shalyminov; Sungjin Lee; Arash Eshghi; and Oliver Lemon

arXiv:1910.01302·cs.CL·October 4, 2019

Data-Efficient Goal-Oriented Conversation with Dialogue Knowledge Transfer Networks

Igor Shalyminov, Sungjin Lee, Arash Eshghi, and Oliver Lemon

PDF

Open Access

TL;DR

This paper introduces DiKTNet, a goal-oriented dialogue system that effectively learns from minimal data through a two-stage transfer learning approach, significantly reducing data requirements while maintaining high performance.

Contribution

The paper presents DiKTNet, a novel transfer learning framework that leverages unsupervised dialogue representation pre-training and multi-source knowledge integration for few-shot dialogue generation.

Findings

01

Outperforms baseline models and previous state-of-the-art by up to 10% in Entity F1.

02

Achieves 3% improvement in BLEU score with only 10% of the training data.

03

Demonstrates effective transfer learning in goal-oriented dialogue systems.

Abstract

Goal-oriented dialogue systems are now being widely adopted in industry where it is of key importance to maintain a rapid prototyping cycle for new products and domains. Data-driven dialogue system development has to be adapted to meet this requirement --- therefore, reducing the amount of data and annotations necessary for training such systems is a central research problem. In this paper, we present the Dialogue Knowledge Transfer Network (DiKTNet), a state-of-the-art approach to goal-oriented dialogue generation which only uses a few example dialogues (i.e. few-shot learning), none of which has to be annotated. We achieve this by performing a 2-stage training. Firstly, we perform unsupervised dialogue representation pre-training on a large source of goal-oriented dialogues in multiple domains, the MetaLWOz corpus. Secondly, at the transfer stage, we train DiKTNet using this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Bidirectional LSTM · Softmax · ELMo