Zero-shot Slot Filling in the Age of LLMs for Dialogue Systems
Mansi Rana, Kadri Hacioglu, Sindhuja Gopalan, Maragathamani, Boothalingam

TL;DR
This paper tackles zero-shot slot filling in dialogue systems by proposing data annotation and knowledge distillation techniques, achieving significant improvements in accuracy and efficiency over existing models in conversational settings.
Contribution
It introduces novel strategies for automatic data annotation and knowledge distillation tailored for dialogue, enhancing zero-shot slot filling performance in conversational AI.
Findings
26% absolute F1 score improvement over vanilla LLMs
34% relative F1 score increase in call center system
Near real-time inference with higher accuracy and low latency
Abstract
Zero-shot slot filling is a well-established subtask of Natural Language Understanding (NLU). However, most existing methods primarily focus on single-turn text data, overlooking the unique complexities of conversational dialogue. Conversational data is highly dynamic, often involving abrupt topic shifts, interruptions, and implicit references that make it difficult to directly apply zero-shot slot filling techniques, even with the remarkable capabilities of large language models (LLMs). This paper addresses these challenges by proposing strategies for automatic data annotation with slot induction and black-box knowledge distillation (KD) from a teacher LLM to a smaller model, outperforming vanilla LLMs on internal datasets by 26% absolute increase in F1 score. Additionally, we introduce an efficient system architecture for call center product settings that surpasses off-the-shelf…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Robotics and Automated Systems · Mobile Agent-Based Network Management
MethodsKnowledge Distillation · Focus
