BlendX: Complex Multi-Intent Detection with Blended Patterns
Yejin Yoon, Jungyeon Lee, Kangsan Kim, Chanhee Park, Taeuk Kim

TL;DR
BlendX introduces more complex and diverse multi-intent detection datasets using rule-based heuristics and ChatGPT, revealing current models' limitations and prompting a reevaluation of MID approaches.
Contribution
The paper presents BlendX, a new suite of datasets with increased complexity and diversity for multi-intent detection, created with novel quality metrics and augmentation strategies.
Findings
State-of-the-art MID models perform poorly on BlendX datasets.
BlendX datasets exhibit greater diversity in intent patterns.
Current MID models need reexamination due to dataset challenges.
Abstract
Task-oriented dialogue (TOD) systems are commonly designed with the presumption that each utterance represents a single intent. However, this assumption may not accurately reflect real-world situations, where users frequently express multiple intents within a single utterance. While there is an emerging interest in multi-intent detection (MID), existing in-domain datasets such as MixATIS and MixSNIPS have limitations in their formulation. To address these issues, we present BlendX, a suite of refined datasets featuring more diverse patterns than their predecessors, elevating both its complexity and diversity. For dataset construction, we utilize both rule-based heuristics as well as a generative tool -- OpenAI's ChatGPT -- which is augmented with a similarity-driven strategy for utterance selection. To ensure the quality of the proposed datasets, we also introduce three novel metrics…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications
