Improving Limited Labeled Dialogue State Tracking with Self-Supervision
Chien-Sheng Wu, Steven Hoi, Caiming Xiong

TL;DR
This paper introduces self-supervised learning techniques to improve dialogue state tracking with limited labeled data, achieving significant accuracy gains on the MultiWOZ dataset.
Contribution
It proposes two novel self-supervised objectives—latent consistency and conversational modeling—to enhance DST performance with minimal labeled data.
Findings
8.95% improvement in joint goal accuracy with 1% labeled data
Additional 1.76% gain using semi-supervised learning with unlabeled data
Self-supervised signals improve robustness and understanding in DST models
Abstract
Existing dialogue state tracking (DST) models require plenty of labeled data. However, collecting high-quality labels is costly, especially when the number of domains increases. In this paper, we address a practical DST problem that is rarely discussed, i.e., learning efficiently with limited labeled data. We present and investigate two self-supervised objectives: preserving latent consistency and modeling conversational behavior. We encourage a DST model to have consistent latent distributions given a perturbed input, making it more robust to an unseen scenario. We also add an auxiliary utterance generation task, modeling a potential correlation between conversational behavior and dialogue states. The experimental results show that our proposed self-supervised signals can improve joint goal accuracy by 8.95\% when only 1\% labeled data is used on the MultiWOZ dataset. We can achieve an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Topic Modeling · AI in Service Interactions
MethodsDynamic Sparse Training
