DSTEA: Improving Dialogue State Tracking via Entity Adaptive Pre-training
Yukyung Lee, Takyoung Kim, Hoonsang Yoon, Pilsung Kang, Junseong Bang,, Misuk Kim

TL;DR
DSTEA introduces an entity adaptive pre-training method that enhances dialogue state tracking models by focusing on key entities, leading to significant accuracy improvements without requiring additional external knowledge.
Contribution
The paper presents DSTEA, a novel pre-training approach that improves DST by selectively training key entities, avoiding the need for external knowledge infusion.
Findings
Up to 2.69% improvement in joint goal accuracy on MultiWOZ datasets.
Effective identification of pivotal entities using four different methods.
Validation of pre-training strategies through extensive experiments.
Abstract
Dialogue State Tracking (DST) is critical for comprehensively interpreting user and system utterances, thereby forming the cornerstone of efficient dialogue systems. Despite past research efforts focused on enhancing DST performance through alterations to the model structure or integrating additional features like graph relations, they often require additional pre-training with external dialogue corpora. In this study, we propose DSTEA, improving Dialogue State Tracking via Entity Adaptive pre-training, which can enhance the encoder through by intensively training key entities in dialogue utterances. DSTEA identifies these pivotal entities from input dialogues utilizing four different methods: ontology information, named-entity recognition, the spaCy, and the flair library. Subsequently, it employs selective knowledge masking to train the model effectively. Remarkably, DSTEA only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
MethodsDynamic Sparse Training
