Cross-Task Generalization via Natural Language Crowdsourcing Instructions
Swaroop Mishra, Daniel Khashabi, Chitta Baral, Hannaneh Hajishirzi

TL;DR
This paper introduces NATURAL INSTRUCTIONS, a large dataset of diverse tasks and instructions, to evaluate and improve models' ability to generalize across tasks by understanding human-readable instructions.
Contribution
It presents a new dataset of 61 tasks with instructions and evaluates cross-task generalization using pre-trained language models, highlighting the importance of instructions for transfer learning.
Findings
Models benefit from instructions with 19% improved generalization to unseen tasks.
Current models lag behind an estimated performance upperbound.
Instructions significantly enhance cross-task transfer capabilities.
Abstract
Humans (e.g., crowdworkers) have a remarkable ability in solving different tasks, by simply reading textual instructions that define them and looking at a few examples. Despite the success of the conventional supervised learning on individual datasets, such models often struggle with generalization across tasks (e.g., a question-answering system cannot solve classification tasks). A long-standing challenge in AI is to build a model that learns a new task by understanding the human-readable instructions that define it. To study this, we introduce NATURAL INSTRUCTIONS, a dataset of 61 distinct tasks, their human-authored instructions, and 193k task instances (input-output pairs). The instructions are obtained from crowdsourcing instructions used to create existing NLP datasets and mapped to a unified schema. Using this meta-dataset, we measure cross-task generalization by training models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Topic Modeling · Multimodal Machine Learning Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Layer Normalization · Residual Connection · Weight Decay · Byte Pair Encoding · Dropout
