PromptDA: Label-guided Data Augmentation for Prompt-based Few-shot   Learners

Canyu Chen; Kai Shu

arXiv:2205.09229·cs.CL·March 24, 2023

PromptDA: Label-guided Data Augmentation for Prompt-based Few-shot Learners

Canyu Chen, Kai Shu

PDF

Open Access 1 Repo

TL;DR

PromptDA is a novel data augmentation framework that leverages label semantics to improve prompt-based few-shot learning in natural language understanding tasks, outperforming traditional methods.

Contribution

It introduces a label-guided data augmentation method that enhances prompt-based few-shot learning by exploiting label semantic information.

Findings

01

Significant performance improvements on few-shot text classification tasks.

02

Effective utilization of label semantics enhances data augmentation.

03

Outperforms existing prompt-based tuning methods in low-resource scenarios.

Abstract

Recent advances in large pre-trained language models (PLMs) lead to impressive gains in natural language understanding (NLU) tasks with task-specific fine-tuning. However, directly fine-tuning PLMs heavily relies on sufficient labeled training instances, which are usually hard to obtain. Prompt-based tuning on PLMs has shown to be powerful for various downstream few-shot tasks. Existing works studying prompt-based tuning for few-shot NLU tasks mainly focus on deriving proper label words with a verbalizer or generating prompt templates to elicit semantics from PLMs. In addition, conventional data augmentation strategies such as synonym substitution, though widely adopted in low-resource scenarios, only bring marginal improvements for prompt-based few-shot learning. Thus, an important research question arises: how to design effective data augmentation methods for prompt-based few-shot…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

canyuchen/promptda
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications