An Explicit-Joint and Supervised-Contrastive Learning Framework for Few-Shot Intent Classification and Slot Filling
Han Liu, Feng Zhang, Xiaotong Zhang, Siyang Zhao, Xianchao Zhang

TL;DR
This paper introduces a novel explicit-joint and supervised-contrastive learning framework that enhances few-shot intent classification and slot filling by leveraging bidirectional interactions and flexible episode construction.
Contribution
It proposes a new joint learning framework combining explicit-joint and supervised contrastive learning for improved few-shot IC and SF performance.
Findings
Achieves promising results on three public datasets.
Effectively leverages bidirectional interactions for joint task reinforcement.
Handles unbalanced datasets with a flexible episode construction method.
Abstract
Intent classification (IC) and slot filling (SF) are critical building blocks in task-oriented dialogue systems. These two tasks are closely-related and can flourish each other. Since only a few utterances can be utilized for identifying fast-emerging new intents and slots, data scarcity issue often occurs when implementing IC and SF. However, few IC/SF models perform well when the number of training samples per class is quite small. In this paper, we propose a novel explicit-joint and supervised-contrastive learning framework for few-shot intent classification and slot filling. Its highlights are as follows. (i) The model extracts intent and slot representations via bidirectional interactions, and extends prototypical network to achieve explicit-joint learning, which guarantees that IC and SF tasks can mutually reinforce each other. (ii) The model integrates with supervised contrastive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Multimodal Machine Learning Applications
