Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Xuxin Cheng, Wanshi Xu, Zhihong Zhu, Hongxiang Li, Yuexian Zou

TL;DR
This paper introduces a multi-level contrastive learning framework for spoken language understanding that enhances intent detection and slot filling by leveraging relationships at utterance, slot, and word levels, achieving state-of-the-art results.
Contribution
It proposes a novel multi-grained contrastive learning approach with self-distillation for improved SLU performance, addressing the mutual guidance between intent and slot subtasks.
Findings
Achieves 2.6% accuracy improvement on MixATIS dataset.
Demonstrates effectiveness of multi-level contrastive learning.
Outperforms previous state-of-the-art models.
Abstract
Spoken language understanding (SLU) is a core task in task-oriented dialogue systems, which aims at understanding the user's current goal through constructing semantic frames. SLU usually consists of two subtasks, including intent detection and slot filling. Although there are some SLU frameworks joint modeling the two subtasks and achieving high performance, most of them still overlook the inherent relationships between intents and slots and fail to achieve mutual guidance between the two subtasks. To solve the problem, we propose a multi-level multi-grained SLU framework MMCL to apply contrastive learning at three levels, including utterance level, slot level, and word level to enable intent and slot to mutually guide each other. For the utterance level, our framework implements coarse granularity contrastive learning and fine granularity contrastive learning simultaneously. Besides,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis
MethodsContrastive Learning
