Selective In-Context Data Augmentation for Intent Detection using   Pointwise V-Information

Yen-Ting Lin; Alexandros Papangelis; Seokhwan Kim; Sungjin Lee,; Devamanyu Hazarika; Mahdi Namazifar; Di Jin; Yang Liu; Dilek Hakkani-Tur

arXiv:2302.05096·cs.CL·February 13, 2023·1 cites

Selective In-Context Data Augmentation for Intent Detection using Pointwise V-Information

Yen-Ting Lin, Alexandros Papangelis, Seokhwan Kim, Sungjin Lee,, Devamanyu Hazarika, Mahdi Namazifar, Di Jin, Yang Liu, Dilek Hakkani-Tur

PDF

Open Access

TL;DR

This paper introduces a novel data augmentation method for intent detection that uses large pre-trained language models and pointwise V-information to generate and filter synthetic data, improving performance especially in few-shot scenarios.

Contribution

The paper proposes a new approach combining PLMs and PVI for intent data augmentation, with intent-aware filtering to enhance classifier training.

Findings

01

Achieves state-of-the-art results in few-shot intent detection.

02

Performs comparably to full-data models in full-shot settings.

03

Demonstrates effective synthetic data generation and filtering.

Abstract

This work focuses on in-context data augmentation for intent detection. Having found that augmentation via in-context prompting of large pre-trained language models (PLMs) alone does not improve performance, we introduce a novel approach based on PLMs and pointwise V-information (PVI), a metric that can measure the usefulness of a datapoint for training a model. Our method first fine-tunes a PLM on a small seed of training data and then synthesizes new datapoints - utterances that correspond to given intents. It then employs intent-aware filtering, based on PVI, to remove datapoints that are not helpful to the downstream intent classifier. Our method is thus able to leverage the expressive power of large language models to produce diverse training data. Empirical results demonstrate that our method can produce synthetic training data that achieve state-of-the-art performance on three…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Anomaly Detection Techniques and Applications