Guiding Generative Language Models for Data Augmentation in Few-Shot Text Classification
Aleksandra Edwards, Asahi Ushio, Jose Camacho-Collados, H\'el\`ene de, Ribaupierre, Alun Preece

TL;DR
This paper explores how guiding GPT-2 with domain expertise and seed selection strategies can enhance data augmentation for few-shot text classification, leading to improved model performance especially in specialized domains.
Contribution
It demonstrates that fine-tuning GPT-2 with selected seed examples and domain expert guidance improves data augmentation quality and classification accuracy in few-shot NLP tasks.
Findings
Fine-tuning GPT-2 improves classification performance.
Seed selection strategies impact sample quality.
Expert-guided augmentation yields further gains.
Abstract
Data augmentation techniques are widely used for enhancing the performance of machine learning models by tackling class imbalance issues and data sparsity. State-of-the-art generative language models have been shown to provide significant gains across different NLP tasks. However, their applicability to data augmentation for text classification tasks in few-shot settings have not been fully explored, especially for specialised domains. In this paper, we leverage GPT-2 (Radford A et al, 2019) for generating artificial training instances in order to improve classification performance. Our aim is to analyse the impact the selection process of seed training examples have over the quality of GPT-generated samples and consequently the classifier performance. We perform experiments with several seed selection strategies that, among others, exploit class hierarchical structures and domain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Cosine Annealing · Weight Decay · Attention Dropout · Adam · Dense Connections · Layer Normalization · Discriminative Fine-Tuning
