OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Srinivasan Iyer, Xi Victoria Lin, Ramakanth Pasunuru, Todor, Mihaylov, Daniel Simig, Ping Yu, Kurt Shuster, Tianlu Wang and, Qing Liu, Punit Singh Koura, Xian Li, Brian O'Horo, Gabriel, Pereyra, Jeff Wang, Christopher Dewan, Asli Celikyilmaz, Luke, Zettlemoyer, Ves Stoyanov

TL;DR
This paper investigates how different instruction-tuning choices affect large language models' ability to generalize to new tasks, introducing a comprehensive benchmark and instruction-tuned models that outperform baseline models across diverse NLP tasks.
Contribution
It introduces OPT-IML, a large instruction-tuning benchmark and models, providing new insights into instruction-tuning decisions and their impact on model generalization.
Findings
OPT-IML models outperform baseline OPT models on multiple benchmarks.
Instruction-tuning decisions significantly influence generalization capabilities.
Large-scale instruction-tuning enhances zero and few-shot learning across diverse tasks.
Abstract
Recent work has shown that fine-tuning large pre-trained language models on a collection of tasks described via instructions, a.k.a. instruction-tuning, improves their zero and few-shot generalization to unseen tasks. However, there is a limited understanding of the performance trade-offs of different decisions made during the instruction-tuning process. These decisions include the scale and diversity of the instruction-tuning benchmark, different task sampling strategies, fine-tuning with and without demonstrations, training using specialized datasets for reasoning and dialogue, and finally, the fine-tuning objectives themselves. In this paper, we characterize the effect of instruction-tuning decisions on downstream task performance when scaling both model and benchmark sizes. To this end, we create OPT-IML Bench: a large benchmark for Instruction Meta-Learning (IML) of 2000 NLP tasks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsOPT-IML · OPT
