Few-Shot Generalization Across Dialogue Tasks

Vladimir Vlasov; Akela Drissner-Schmid; Alan Nichol

arXiv:1811.11707·cs.CL·November 29, 2018·31 cites

Few-Shot Generalization Across Dialogue Tasks

Vladimir Vlasov, Akela Drissner-Schmid, Alan Nichol

PDF

Open Access 2 Repos

TL;DR

This paper introduces REDP, a novel dialogue policy embedding system with memory and attention, demonstrating superior few-shot generalization across dialogue tasks and achieving perfect accuracy on the bAbI benchmark.

Contribution

The paper presents REDP, a new recurrent embedding dialogue policy with memory and attention, enabling better transfer and few-shot learning in dialogue management.

Findings

01

REDP outperforms baseline LSTM on dialogue tasks.

02

Both REDP and baseline achieve 100% accuracy on bAbI dialogue.

03

REDP effectively generalizes to new dialogue domains.

Abstract

Machine-learning based dialogue managers are able to learn complex behaviors in order to complete a task, but it is not straightforward to extend their capabilities to new domains. We investigate different policies' ability to handle uncooperative user behavior, and how well expertise in completing one task (such as restaurant reservations) can be reapplied when learning a new one (e.g. booking a hotel). We introduce the Recurrent Embedding Dialogue Policy (REDP), which embeds system actions and dialogue states in the same vector space. REDP contains a memory component and attention mechanism based on a modified Neural Turing Machine, and significantly outperforms a baseline LSTM classifier on this task. We also show that both our architecture and baseline solve the bAbI dialogue task, achieving 100% test accuracy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Topic Modeling · Context-Aware Activity Recognition Systems

MethodsSoftmax · Sigmoid Activation · Tanh Activation · Location-based Attention · Long Short-Term Memory · Content-based Attention · Neural Turing Machine