Generate then Refine: Data Augmentation for Zero-shot Intent Detection
I-Fan Lin, Faegheh Hasibi, Suzan Verberne

TL;DR
This paper introduces a two-stage data augmentation approach using a large language model and a smaller refiner to improve zero-shot intent detection in new domains, outperforming existing methods.
Contribution
It presents a novel two-step method combining zero-shot LLM generation and a refiner model to enhance intent detection data quality in unseen domains.
Findings
Refiner improves data diversity and utility over zero-shot baseline.
The method achieves better intent classification accuracy on real data.
Two-step approach outperforms common baseline data augmentation methods.
Abstract
In this short paper we propose a data augmentation method for intent detection in zero-resource domains. Existing data augmentation methods rely on few labelled examples for each intent category, which can be expensive in settings with many possible intents. We use a two-stage approach: First, we generate utterances for intent labels using an open-source large language model in a zero-shot setting. Second, we develop a smaller sequence-to-sequence model (the Refiner), to improve the generated utterances. The Refiner is fine-tuned on seen domains and then applied to unseen domains. We evaluate our method by training an intent classifier on the generated data, and evaluating it on real (human) data. We find that the Refiner significantly improves the data utility and diversity over the zero-shot LLM baseline for unseen domains and over common baseline approaches. Our results indicate that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMedical Imaging Techniques and Applications · Anomaly Detection Techniques and Applications · Radiation Detection and Scintillator Technologies
