Intent Detection in the Age of LLMs

Gaurav Arora; Shreya Jain; Srujana Merugu

arXiv:2410.01627·cs.CL·October 3, 2024·2 cites

Intent Detection in the Age of LLMs

Gaurav Arora, Shreya Jain, Srujana Merugu

PDF

Open Access

TL;DR

This paper explores how large language models can be adapted for intent detection in dialogue systems, comparing their performance with traditional models, and proposing hybrid approaches to improve accuracy and reduce latency.

Contribution

It introduces a hybrid system combining LLMs and sentence transformers with uncertainty routing, and demonstrates improved OOS detection using internal LLM representations.

Findings

01

Hybrid system achieves near-native LLM accuracy with 50% less latency.

02

Internal LLM representations improve OOS detection accuracy by over 5%.

03

Performance is influenced by intent label scope and label space size.

Abstract

Intent detection is a critical component of task-oriented dialogue systems (TODS) which enables the identification of suitable actions to address user utterances at each dialog turn. Traditional approaches relied on computationally efficient supervised sentence transformer encoder models, which require substantial training data and struggle with out-of-scope (OOS) detection. The emergence of generative large language models (LLMs) with intrinsic world knowledge presents new opportunities to address these challenges. In this work, we adapt 7 SOTA LLMs using adaptive in-context learning and chain-of-thought prompting for intent detection, and compare their performance with contrastively fine-tuned sentence transformer (SetFit) models to highlight prediction quality and latency tradeoff. We propose a hybrid system using uncertainty based routing strategy to combine the two approaches that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques