Robust Dialogue State Tracking with Weak Supervision and Sparse Data

Michael Heck; Nurul Lubis; Carel van Niekerk; Shutong Feng; Christian; Geishauser; Hsien-Chin Lin; Milica Ga\v{s}i\'c

arXiv:2202.03354·cs.CL·August 10, 2022

Robust Dialogue State Tracking with Weak Supervision and Sparse Data

Michael Heck, Nurul Lubis, Carel van Niekerk, Shutong Feng, Christian, Geishauser, Hsien-Chin Lin, Milica Ga\v{s}i\'c

PDF

Open Access

TL;DR

This paper introduces a robust extractive dialogue state tracking approach that operates without manual span labels, using novel dropout methods and a unified encoder to enhance performance on sparse data and new topics.

Contribution

It presents a new training strategy and model architecture that eliminate the need for fine-grained supervision in dialogue state tracking.

Findings

01

Achieves state-of-the-art results on multiple benchmarks.

02

Demonstrates robustness to sample sparsity and new concepts.

03

Effectively learns from non-dialogue data.

Abstract

Generalising dialogue state tracking (DST) to new data is especially challenging due to the strong reliance on abundant and fine-grained supervision during training. Sample sparsity, distributional shift and the occurrence of new concepts and topics frequently lead to severe performance degradation during inference. In this paper we propose a training strategy to build extractive DST models without the need for fine-grained manual span labels. Two novel input-level dropout methods mitigate the negative impact of sample sparsity. We propose a new model architecture with a unified encoder that supports value as well as slot independence by leveraging the attention mechanism. We combine the strengths of triple copy strategy DST and value matching to benefit from complementary predictions without violating the principle of ontology independence. Our experiments demonstrate that an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Topic Modeling · Context-Aware Activity Recognition Systems

MethodsDynamic Sparse Training · Ontology · Dropout