Toward Scalable Neural Dialogue State Tracking Model
Elnaz Nouri, Ehsan Hosseini-Asl

TL;DR
This paper introduces a scalable neural dialogue state tracking model that significantly reduces latency while maintaining high accuracy, making it more suitable for real-world deployment.
Contribution
The proposed model simplifies the architecture by using a single recurrent network with global conditioning, reducing training and inference latency by 35% compared to previous models.
Findings
Reduces training and inference latency by 35% on average.
Maintains belief state tracking accuracy at 97.38% turn request and 88.51% joint goal.
Outperforms GLAD on Multi-WoZ dataset in turn inform and joint goal accuracy.
Abstract
The latency in the current neural based dialogue state tracking models prohibits them from being used efficiently for deployment in production systems, albeit their highly accurate performance. This paper proposes a new scalable and accurate neural dialogue state tracking model, based on the recently proposed Global-Local Self-Attention encoder (GLAD) model by Zhong et al. which uses global modules to share parameters between estimators for different types (called slots) of dialogue states, and uses local modules to learn slot-specific features. By using only one recurrent networks with global conditioning, compared to (1 + \# slots) recurrent networks with global and local conditioning used in the GLAD model, our proposed model reduces the latency in training and inference times by on average, while preserving performance of belief state tracking, by on turn request…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Machine Learning in Healthcare
