Out-of-Task Training for Dialog State Tracking Models
Michael Heck, Carel van Niekerk, Nurul Lubis, Christian Geishauser,, Hsien-Chin Lin, Marco Moresi, Milica Ga\v{s}i\'c

TL;DR
This paper introduces a novel approach to improve dialog state tracking by leveraging non-dialog NLP data, addressing data sparsity issues and enhancing model training.
Contribution
It demonstrates the effective use of unrelated NLP data for training dialog state trackers, expanding beyond traditional dialog-specific datasets.
Findings
Non-dialog NLP data improves DST performance.
Transfer learning mitigates data sparsity in dialog tasks.
Method enhances generalization of DST models.
Abstract
Dialog state tracking (DST) suffers from severe data sparsity. While many natural language processing (NLP) tasks benefit from transfer learning and multi-task learning, in dialog these methods are limited by the amount of available data and by the specificity of dialog applications. In this work, we successfully utilize non-dialog data from unrelated NLP tasks to train dialog state trackers. This opens the door to the abundance of unrelated NLP corpora to mitigate the data sparsity issue inherent to DST.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDynamic Sparse Training
