OPT-IML: Scaling Language Model Instruction Meta Learning through the   Lens of Generalization

Srinivasan Iyer; Xi Victoria Lin; Ramakanth Pasunuru; Todor; Mihaylov; Daniel Simig; Ping Yu; Kurt Shuster; Tianlu Wang and; Qing Liu; Punit Singh Koura; Xian Li; Brian O'Horo; Gabriel; Pereyra; Jeff Wang; Christopher Dewan; Asli Celikyilmaz; Luke; Zettlemoyer; Ves Stoyanov

arXiv:2212.12017·cs.CL·January 31, 2023·85 cites

OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization

Srinivasan Iyer, Xi Victoria Lin, Ramakanth Pasunuru, Todor, Mihaylov, Daniel Simig, Ping Yu, Kurt Shuster, Tianlu Wang and, Qing Liu, Punit Singh Koura, Xian Li, Brian O'Horo, Gabriel, Pereyra, Jeff Wang, Christopher Dewan, Asli Celikyilmaz, Luke, Zettlemoyer, Ves Stoyanov

PDF

Open Access 1 Repo 5 Models

TL;DR

This paper investigates how different instruction-tuning choices affect large language models' ability to generalize to new tasks, introducing a comprehensive benchmark and instruction-tuned models that outperform baseline models across diverse NLP tasks.

Contribution

It introduces OPT-IML, a large instruction-tuning benchmark and models, providing new insights into instruction-tuning decisions and their impact on model generalization.

Findings

01

OPT-IML models outperform baseline OPT models on multiple benchmarks.

02

Instruction-tuning decisions significantly influence generalization capabilities.

03

Large-scale instruction-tuning enhances zero and few-shot learning across diverse tasks.

Abstract

Recent work has shown that fine-tuning large pre-trained language models on a collection of tasks described via instructions, a.k.a. instruction-tuning, improves their zero and few-shot generalization to unseen tasks. However, there is a limited understanding of the performance trade-offs of different decisions made during the instruction-tuning process. These decisions include the scale and diversity of the instruction-tuning benchmark, different task sampling strategies, fine-tuning with and without demonstrations, training using specialized datasets for reasoning and dialogue, and finally, the fine-tuning objectives themselves. In this paper, we characterize the effect of instruction-tuning decisions on downstream task performance when scaling both model and benchmark sizes. To this end, we create OPT-IML Bench: a large benchmark for Instruction Meta-Learning (IML) of 2000 NLP tasks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tanyuqian/cappy
jax

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsOPT-IML · OPT