Copy-Enhanced Heterogeneous Information Learning for Dialogue State Tracking
Qingbin Liu, Shizhu He, Kang Liu, Shengping Liu, and Jun Zhao

TL;DR
This paper introduces CEDST, a novel dialogue state tracking model that effectively integrates semantic information from ontology and dialogue texts, enabling accurate generation of known and unknown values, and achieves state-of-the-art results.
Contribution
The paper proposes a copy-enhanced heterogeneous information learning model with multi-encoder-decoder architecture for improved dialogue state tracking, especially in handling unknown values.
Findings
Achieves state-of-the-art results on two datasets.
Effectively generates unknown values by copying from heterogeneous texts.
Decomposes large state spaces into smaller, manageable ones.
Abstract
Dialogue state tracking (DST) is an essential component in task-oriented dialogue systems, which estimates user goals at every dialogue turn. However, most previous approaches usually suffer from the following problems. Many discriminative models, especially end-to-end (E2E) models, are difficult to extract unknown values that are not in the candidate ontology; previous generative models, which can extract unknown values from utterances, degrade the performance due to ignoring the semantic information of pre-defined ontology. Besides, previous generative models usually need a hand-crafted list to normalize the generated values. How to integrate the semantic information of pre-defined ontology and dialogue text (heterogeneous texts) to generate unknown values and improve performance becomes a severe challenge. In this paper, we propose a Copy-Enhanced Heterogeneous Information Learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Topic Modeling · Context-Aware Activity Recognition Systems
MethodsDynamic Sparse Training
