Instruction Induction: From Few Examples to Natural Language Task   Descriptions

Or Honovich; Uri Shaham; Samuel R. Bowman; Omer Levy

arXiv:2205.10782·cs.CL·May 24, 2022·6 cites

Instruction Induction: From Few Examples to Natural Language Task Descriptions

Or Honovich, Uri Shaham, Samuel R. Bowman, Omer Levy

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that large language models can infer natural language task descriptions from few examples, enabling explicit instruction induction which improves task understanding and performance.

Contribution

The paper introduces the instruction induction challenge, a new dataset, and an evaluation metric, showing that instruction generation ability emerges in sufficiently large and aligned models like InstructGPT.

Findings

01

InstructGPT achieves 65.7% of human performance in instruction execution.

02

GPT-3 achieves only 9.8% of human performance.

03

Instruction induction emerges in large, instruction-following models.

Abstract

Large language models are able to perform a task by conditioning on a few input-output demonstrations - a paradigm known as in-context learning. We show that language models can explicitly infer an underlying task from a few demonstrations by prompting them to generate a natural language instruction that fits the examples. To explore this ability, we introduce the instruction induction challenge, compile a dataset consisting of 24 tasks, and define a novel evaluation metric based on executing the generated instruction. We discover that, to a large extent, the ability to generate instructions does indeed emerge when using a model that is both large enough and aligned to follow instructions; InstructGPT achieves 65.7% of human performance in our execution-based metric, while the original GPT-3 model reaches only 9.8% of human performance. This surprising result suggests that instruction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

orhonovich/instruction-induction
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Machine Learning and Data Classification · Explainable Artificial Intelligence (XAI)

Methods15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Weight Decay · Byte Pair Encoding · Dense Connections · {Dispute@FaQ-s}How to file a dispute with Expedia? · Dropout · Cosine Annealing · Attention Dropout