Multi-Intent Spoken Language Understanding: Methods, Trends, and Challenges

Di Wu; Ruiyu Fang; Liting Jiang; Shuangyong Song; Xiaomeng Huang; Shiquan Wang; Zhongqiu Li; Lingling Shi; Mengjiao Bao; Yongxiang Li; Hao Huang

arXiv:2512.11258·cs.CL·December 15, 2025

Multi-Intent Spoken Language Understanding: Methods, Trends, and Challenges

Di Wu, Ruiyu Fang, Liting Jiang, Shuangyong Song, Xiaomeng Huang, Shiquan Wang, Zhongqiu Li, Lingling Shi, Mengjiao Bao, Yongxiang Li, Hao Huang

PDF

Open Access

TL;DR

This survey reviews recent advances in multi-intent spoken language understanding, focusing on decoding paradigms and modeling approaches, analyzing their strengths and limitations, and discussing future research challenges.

Contribution

It provides a comprehensive systematic review of existing studies on multi-intent SLU, highlighting recent progress, model comparisons, and future research directions.

Findings

01

Performance varies across different models and approaches.

02

Decoding paradigms significantly impact SLU effectiveness.

03

Current challenges include handling complex multi-intent utterances.

Abstract

Multi-intent spoken language understanding (SLU) involves two tasks: multiple intent detection and slot filling, which jointly handle utterances containing more than one intent. Owing to this characteristic, which closely reflects real-world applications, the task has attracted increasing research attention, and substantial progress has been achieved. However, there remains a lack of a comprehensive and systematic review of existing studies on multi-intent SLU. To this end, this paper presents a survey of recent advances in multi-intent SLU. We provide an in-depth overview of previous research from two perspectives: decoding paradigms and modeling approaches. On this basis, we further compare the performance of representative models and analyze their strengths and limitations. Finally, we discuss the current challenges and outline promising directions for future research. We hope this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Topic Modeling · Multimodal Machine Learning Applications