An Empirical Survey of Data Augmentation for Limited Data Learning in   NLP

Jiaao Chen; Derek Tam; Colin Raffel; Mohit Bansal; Diyi Yang

arXiv:2106.07499·cs.CL·June 15, 2021·21 cites

An Empirical Survey of Data Augmentation for Limited Data Learning in NLP

Jiaao Chen, Derek Tam, Colin Raffel, Mohit Bansal, Diyi Yang

PDF

Open Access 1 Models

TL;DR

This paper provides a comprehensive empirical overview of data augmentation techniques in NLP for limited labeled data scenarios, evaluating various methods across multiple tasks and datasets to guide practitioners.

Contribution

It systematically surveys recent data augmentation methods in NLP for low-resource settings and evaluates their effectiveness across diverse tasks and datasets.

Findings

01

Token-level augmentations often improve classification accuracy.

02

Adversarial augmentations enhance model robustness.

03

Effectiveness of augmentation methods varies by task and dataset.

Abstract

NLP has achieved great progress in the past decade through the use of neural models and large labeled datasets. The dependence on abundant data prevents NLP models from being applied to low-resource settings or novel tasks where significant time, money, or expertise is required to label massive amounts of textual data. Recently, data augmentation methods have been explored as a means of improving data efficiency in NLP. To date, there has been no systematic empirical overview of data augmentation for NLP in the limited labeled data setting, making it difficult to understand which methods work in which settings. In this paper, we provide an empirical survey of recent progress on data augmentation for NLP in the limited labeled data setting, summarizing the landscape of methods (including token-level augmentations, sentence-level augmentations, adversarial augmentations, and hidden-space…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
Jed612/BERT-ED
model· 1 dl
1 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications