Emergent Abilities of Large Language Models

Jason Wei; Yi Tay; Rishi Bommasani; Colin Raffel; Barret Zoph,; Sebastian Borgeaud; Dani Yogatama; Maarten Bosma; Denny Zhou; Donald Metzler,; Ed H. Chi; Tatsunori Hashimoto; Oriol Vinyals; Percy Liang; Jeff Dean,; William Fedus

arXiv:2206.07682·cs.CL·October 27, 2022·1.0k cites

Emergent Abilities of Large Language Models

Jason Wei, Yi Tay, Rishi Bommasani, Colin Raffel, Barret Zoph,, Sebastian Borgeaud, Dani Yogatama, Maarten Bosma, Denny Zhou, Donald Metzler,, Ed H. Chi, Tatsunori Hashimoto, Oriol Vinyals, Percy Liang, Jeff Dean,, William Fedus

PDF

Open Access 2 Videos

TL;DR

This paper explores the unpredictable emergence of new abilities in large language models as they scale, which are not present in smaller models and cannot be predicted by performance extrapolation.

Contribution

It introduces the concept of emergent abilities in large language models, highlighting their unpredictable nature and implications for future scaling.

Findings

01

Emergent abilities appear only in larger models.

02

Emergence cannot be predicted from smaller models' performance.

03

Scaling can further expand language model capabilities.

Abstract

Scaling up language models has been shown to predictably improve performance and sample efficiency on a wide range of downstream tasks. This paper instead discusses an unpredictable phenomenon that we refer to as emergent abilities of large language models. We consider an ability to be emergent if it is not present in smaller models but is present in larger models. Thus, emergent abilities cannot be predicted simply by extrapolating the performance of smaller models. The existence of such emergence implies that additional scaling could further expand the range of capabilities of language models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

The Ghost in the Machine — Noam Chomsky· youtube

‘We Must Slow Down the Race’ – X AI, GPT 4 Can Now Do Science and Altman GPT 5 Statement· youtube

Taxonomy

TopicsTopic Modeling · Machine Learning and Data Classification · Machine Learning and Algorithms