An Incomplete Loop: Instruction Inference, Instruction Following, and   In-context Learning in Language Models

Emmy Liu; Graham Neubig; Jacob Andreas

arXiv:2404.03028·cs.CL·August 30, 2024·3 cites

An Incomplete Loop: Instruction Inference, Instruction Following, and In-context Learning in Language Models

Emmy Liu, Graham Neubig, Jacob Andreas

PDF

Open Access

TL;DR

This paper investigates how language models perform instruction inference, instruction following, and in-context learning, revealing their distinct reasoning capabilities and limitations across different tasks and models.

Contribution

It provides a comparative analysis of different reasoning types in LMs, highlighting the non-systematic nature of their learning mechanisms and dissociations between inference and learning.

Findings

01

LMs can learn from few-shot prompts without explaining their rules

02

They can infer task descriptions but fail to learn from human descriptions

03

Different reasoning capabilities are invoked by similar prompting procedures

Abstract

Modern language models (LMs) can learn to perform new tasks in different ways: in instruction following, the target task is described explicitly in natural language; in few-shot prompting, the task is specified implicitly with a small number of examples; in instruction inference, LMs are presented with in-context examples and are then prompted to generate a natural language task description before making predictions. Each of these procedures may be thought of as invoking a different form of reasoning: instruction following involves deductive reasoning, few-shot prompting involves inductive reasoning, and instruction inference involves abductive reasoning. How do these different capabilities relate? Across four LMs (from the gpt and llama families) and two learning problems (involving arithmetic functions and machine translation) we find a strong dissociation between the different types…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Residual Connection · Dropout · Layer Normalization · Multi-Head Attention · Weight Decay · Adam · Cosine Annealing · Byte Pair Encoding