TL;DR
This paper introduces a meta-learning approach for dialogue state tracking that significantly improves performance in low-data, zero-shot, and few-shot domain transfer scenarios, enhancing chatbot adaptability.
Contribution
The paper proposes a novel meta-learner D-REPTILE tailored for DST, demonstrating its effectiveness across multiple domains and models with substantial performance gains.
Findings
5-25% improvement over baseline in low-data settings
Meta-learner is model-agnostic and adaptable to various DST systems
Significant benefits shown across different domains and datasets
Abstract
Dialogue State Tracking (DST) forms a core component of automated chatbot based systems designed for specific goals like hotel, taxi reservation, tourist information, etc. With the increasing need to deploy such systems in new domains, solving the problem of zero/few-shot DST has become necessary. There has been a rising trend for learning to transfer knowledge from resource-rich domains to unknown domains with minimal need for additional data. In this work, we explore the merits of meta-learning algorithms for this transfer and hence, propose a meta-learner D-REPTILE specific to the DST problem. With extensive experimentation, we provide clear evidence of benefits over conventional approaches across different domains, methods, base models, and datasets with significant (5-25%) improvement over the baseline in a low-data setting. Our proposed meta-learner is agnostic of the underlying…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDynamic Sparse Training
