Symbol tuning improves in-context learning in language models

Jerry Wei; Le Hou; Andrew Lampinen; Xiangning Chen; Da; Huang; Yi Tay; Xinyun Chen; Yifeng Lu; Denny Zhou; Tengyu Ma; and Quoc V. Le

arXiv:2305.08298·cs.CL·January 2, 2024·1 cites

Symbol tuning improves in-context learning in language models

Jerry Wei, Le Hou, Andrew Lampinen, Xiangning Chen, Da, Huang, Yi Tay, Xinyun Chen, Yifeng Lu, Denny Zhou, Tengyu Ma, and Quoc V. Le

PDF

Open Access 2 Datasets

TL;DR

Symbol tuning, which replaces natural language labels with arbitrary symbols during finetuning, enhances language models' in-context learning, robustness, and reasoning abilities, especially on unseen tasks and when handling label variations.

Contribution

The paper introduces symbol tuning as a novel finetuning method that improves in-context learning and reasoning in large language models.

Findings

01

Boosts performance on unseen in-context tasks

02

Enhances robustness to underspecified prompts

03

Improves ability to follow flipped labels in-context

Abstract

We present symbol tuning - finetuning language models on in-context input-label pairs where natural language labels (e.g., "positive/negative sentiment") are replaced with arbitrary symbols (e.g., "foo/bar"). Symbol tuning leverages the intuition that when a model cannot use instructions or natural language labels to figure out a task, it must instead do so by learning the input-label mappings. We experiment with symbol tuning across Flan-PaLM models up to 540B parameters and observe benefits across various settings. First, symbol tuning boosts performance on unseen in-context learning tasks and is much more robust to underspecified prompts, such as those without instructions or without natural language labels. Second, symbol-tuned models are much stronger at algorithmic reasoning tasks, with up to 18.2% better performance on the List Functions benchmark and up to 15.3% better…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification