How Many Data Samples is an Additional Instruction Worth?

Ravsehaj Singh Puri; Swaroop Mishra; Mihir Parmar; Chitta Baral

arXiv:2203.09161·cs.CL·February 15, 2023

How Many Data Samples is an Additional Instruction Worth?

Ravsehaj Singh Puri, Swaroop Mishra, Mihir Parmar, Chitta Baral

PDF

1 Repo

TL;DR

This paper investigates whether adding alternate instructions can improve instruction-tuned NLP models, finding that such augmentation can be as effective as adding around 200 data samples, especially in low-data scenarios.

Contribution

The study demonstrates that instruction augmentation with alternate instructions significantly enhances model performance, offering a practical way for non-experts to improve NLP tasks without extensive data collection.

Findings

01

Instruction augmentation improves performance by up to 35%.

02

Additional instructions can be equivalent to approximately 200 data samples.

03

Effectiveness is especially pronounced in low-data regimes.

Abstract

Recently introduced instruction-paradigm empowers non-expert users to leverage NLP resources by defining a new task in natural language. Instruction-tuned models have significantly outperformed multitask learning models (without instruction); however they are far from state-of-the-art task-specific models. Conventional approaches to improve model performance via creating datasets with large number of task instances or architectural changes in the model may not be feasible for non-expert users. However, they can write alternate instructions to represent an instruction task. Is Instruction-augmentation helpful? We augment a subset of tasks in the expanded version of NATURAL INSTRUCTIONS with additional instructions and find that it significantly improves model performance (up to 35%), especially in the low-data regime. Our results indicate that an additional instruction can be equivalent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ravsehajsinghpuri/multi-variant-instructions
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.