An Incomplete Loop: Instruction Inference, Instruction Following, and In-context Learning in Language Models
Emmy Liu, Graham Neubig, Jacob Andreas

TL;DR
This paper investigates how language models perform instruction inference, instruction following, and in-context learning, revealing their distinct reasoning capabilities and limitations across different tasks and models.
Contribution
It provides a comparative analysis of different reasoning types in LMs, highlighting the non-systematic nature of their learning mechanisms and dissociations between inference and learning.
Findings
LMs can learn from few-shot prompts without explaining their rules
They can infer task descriptions but fail to learn from human descriptions
Different reasoning capabilities are invoked by similar prompting procedures
Abstract
Modern language models (LMs) can learn to perform new tasks in different ways: in instruction following, the target task is described explicitly in natural language; in few-shot prompting, the task is specified implicitly with a small number of examples; in instruction inference, LMs are presented with in-context examples and are then prompted to generate a natural language task description before making predictions. Each of these procedures may be thought of as invoking a different form of reasoning: instruction following involves deductive reasoning, few-shot prompting involves inductive reasoning, and instruction inference involves abductive reasoning. How do these different capabilities relate? Across four LMs (from the gpt and llama families) and two learning problems (involving arithmetic functions and machine translation) we find a strong dissociation between the different types…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Residual Connection · Dropout · Layer Normalization · Multi-Head Attention · Weight Decay · Adam · Cosine Annealing · Byte Pair Encoding
