MetaASSIST: Robust Dialogue State Tracking with Meta Learning
Fanghua Ye, Xi Wang, Jie Huang, Shenghui Li, Samuel Stern, Emine, Yilmaz

TL;DR
MetaASSIST introduces a meta-learning framework to adaptively tune pseudo-label weighting in dialogue state tracking, significantly improving robustness and achieving state-of-the-art accuracy on MultiWOZ 2.4.
Contribution
It proposes a novel meta-learning approach to dynamically learn weighting parameters for pseudo-labels in DST, addressing limitations of fixed parameters.
Findings
Achieves a joint goal accuracy of 80.10% on MultiWOZ 2.4.
Demonstrates competitive performance with flexible weighting schemes.
Outperforms previous methods in robustness to noisy annotations.
Abstract
Existing dialogue datasets contain lots of noise in their state annotations. Such noise can hurt model training and ultimately lead to poor generalization performance. A general framework named ASSIST has recently been proposed to train robust dialogue state tracking (DST) models. It introduces an auxiliary model to generate pseudo labels for the noisy training set. These pseudo labels are combined with vanilla labels by a common fixed weighting parameter to train the primary DST model. Notwithstanding the improvements of ASSIST on DST, tuning the weighting parameter is challenging. Moreover, a single parameter shared by all slots and all instances may be suboptimal. To overcome these limitations, we propose a meta learning-based framework MetaASSIST to adaptively learn the weighting parameter. Specifically, we propose three schemes with varying degrees of flexibility, ranging from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Topic Modeling · Context-Aware Activity Recognition Systems
MethodsDynamic Sparse Training
