DSTEA: Improving Dialogue State Tracking via Entity Adaptive   Pre-training

Yukyung Lee; Takyoung Kim; Hoonsang Yoon; Pilsung Kang; Junseong Bang,; Misuk Kim

arXiv:2207.03858·cs.CL·July 25, 2023·1 cites

DSTEA: Improving Dialogue State Tracking via Entity Adaptive Pre-training

Yukyung Lee, Takyoung Kim, Hoonsang Yoon, Pilsung Kang, Junseong Bang,, Misuk Kim

PDF

Open Access

TL;DR

DSTEA introduces an entity adaptive pre-training method that enhances dialogue state tracking models by focusing on key entities, leading to significant accuracy improvements without requiring additional external knowledge.

Contribution

The paper presents DSTEA, a novel pre-training approach that improves DST by selectively training key entities, avoiding the need for external knowledge infusion.

Findings

01

Up to 2.69% improvement in joint goal accuracy on MultiWOZ datasets.

02

Effective identification of pivotal entities using four different methods.

03

Validation of pre-training strategies through extensive experiments.

Abstract

Dialogue State Tracking (DST) is critical for comprehensively interpreting user and system utterances, thereby forming the cornerstone of efficient dialogue systems. Despite past research efforts focused on enhancing DST performance through alterations to the model structure or integrating additional features like graph relations, they often require additional pre-training with external dialogue corpora. In this study, we propose DSTEA, improving Dialogue State Tracking via Entity Adaptive pre-training, which can enhance the encoder through by intensively training key entities in dialogue utterances. DSTEA identifies these pivotal entities from input dialogues utilizing four different methods: ontology information, named-entity recognition, the spaCy, and the flair library. Subsequently, it employs selective knowledge masking to train the model effectively. Remarkably, DSTEA only…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques

MethodsDynamic Sparse Training