Out-of-Task Training for Dialog State Tracking Models

Michael Heck; Carel van Niekerk; Nurul Lubis; Christian Geishauser,; Hsien-Chin Lin; Marco Moresi; Milica Ga\v{s}i\'c

arXiv:2011.09379·cs.CL·November 19, 2020

Out-of-Task Training for Dialog State Tracking Models

Michael Heck, Carel van Niekerk, Nurul Lubis, Christian Geishauser,, Hsien-Chin Lin, Marco Moresi, Milica Ga\v{s}i\'c

PDF

TL;DR

This paper introduces a novel approach to improve dialog state tracking by leveraging non-dialog NLP data, addressing data sparsity issues and enhancing model training.

Contribution

It demonstrates the effective use of unrelated NLP data for training dialog state trackers, expanding beyond traditional dialog-specific datasets.

Findings

01

Non-dialog NLP data improves DST performance.

02

Transfer learning mitigates data sparsity in dialog tasks.

03

Method enhances generalization of DST models.

Abstract

Dialog state tracking (DST) suffers from severe data sparsity. While many natural language processing (NLP) tasks benefit from transfer learning and multi-task learning, in dialog these methods are limited by the amount of available data and by the specificity of dialog applications. In this work, we successfully utilize non-dialog data from unrelated NLP tasks to train dialog state trackers. This opens the door to the abundance of unrelated NLP corpora to mitigate the data sparsity issue inherent to DST.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDynamic Sparse Training