Few Shot Dialogue State Tracking using Meta-learning

Saket Dingliwal; Bill Gao; Sanchit Agarwal; Chien-Wei Lin; Tagyoung; Chung; Dilek Hakkani-Tur

arXiv:2101.06779·cs.CL·April 6, 2021

Few Shot Dialogue State Tracking using Meta-learning

Saket Dingliwal, Bill Gao, Sanchit Agarwal, Chien-Wei Lin, Tagyoung, Chung, Dilek Hakkani-Tur

PDF

1 Repo

TL;DR

This paper introduces a meta-learning approach for dialogue state tracking that significantly improves performance in low-data, zero-shot, and few-shot domain transfer scenarios, enhancing chatbot adaptability.

Contribution

The paper proposes a novel meta-learner D-REPTILE tailored for DST, demonstrating its effectiveness across multiple domains and models with substantial performance gains.

Findings

01

5-25% improvement over baseline in low-data settings

02

Meta-learner is model-agnostic and adaptable to various DST systems

03

Significant benefits shown across different domains and datasets

Abstract

Dialogue State Tracking (DST) forms a core component of automated chatbot based systems designed for specific goals like hotel, taxi reservation, tourist information, etc. With the increasing need to deploy such systems in new domains, solving the problem of zero/few-shot DST has become necessary. There has been a rising trend for learning to transfer knowledge from resource-rich domains to unknown domains with minimal need for additional data. In this work, we explore the merits of meta-learning algorithms for this transfer and hence, propose a meta-learner D-REPTILE specific to the DST problem. With extensive experimentation, we provide clear evidence of benefits over conventional approaches across different domains, methods, base models, and datasets with significant (5-25%) improvement over the baseline in a low-data setting. Our proposed meta-learner is agnostic of the underlying…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

saketdingliwal/Few-Shot-DST
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDynamic Sparse Training