Few-Shot Text Classification with Triplet Networks, Data Augmentation,   and Curriculum Learning

Jason Wei; Chengyu Huang; Soroush Vosoughi; Yu Cheng; Shiqi Xu

arXiv:2103.07552·cs.CL·June 16, 2021

Few-Shot Text Classification with Triplet Networks, Data Augmentation, and Curriculum Learning

Jason Wei, Chengyu Huang, Soroush Vosoughi, Yu Cheng, Shiqi Xu

PDF

1 Repo

TL;DR

This paper enhances few-shot text classification by combining triplet networks with data augmentation and curriculum learning, leading to faster training and improved accuracy on multiple tasks.

Contribution

It introduces curriculum data augmentation, a novel training strategy that effectively integrates augmented data to boost few-shot text classification performance.

Findings

01

Data augmentation improves triplet network accuracy by up to 3%.

02

Curriculum data augmentation accelerates training and enhances robustness.

03

Two-stage and gradual schedules outperform single-stage training.

Abstract

Few-shot text classification is a fundamental NLP task in which a model aims to classify text into a large number of categories, given only a few training examples per category. This paper explores data augmentation -- a technique particularly suitable for training with limited data -- for this few-shot, highly-multiclass text classification setting. On four diverse text classification tasks, we find that common data augmentation techniques can improve the performance of triplet networks by up to 3.0% on average. To further boost performance, we present a simple training strategy called curriculum data augmentation, which leverages curriculum learning by first training on only original examples and then introducing augmented data as training progresses. We explore a two-stage and a gradual schedule, and find that, compared with standard single-stage training, curriculum data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jasonwei20/triplet-loss
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.