The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung, Yi Tay,, Denny Zhou, Quoc V. Le, Barret Zoph, Jason Wei, Adam Roberts

TL;DR
This paper analyzes the design choices behind instruction tuning methods, especially Flan 2022, highlighting the importance of task balancing, mixed prompt training, and demonstrating that instruction-tuned models like Flan-T5 are more efficient and effective for downstream tasks.
Contribution
The paper provides a detailed ablation study of Flan 2022's design decisions, revealing critical factors for successful instruction tuning and offering a publicly available dataset collection.
Findings
Task balancing and enrichment are crucial for effective instruction tuning.
Training with mixed prompt settings improves performance across evaluation scenarios.
Flan-T5 requires less finetuning and converges faster than T5 on downstream tasks.
Abstract
We study the design decisions of publicly available instruction tuning methods, and break down the development of Flan 2022 (Chung et al., 2022). Through careful ablation studies on the Flan Collection of tasks and methods, we tease apart the effect of design decisions which enable Flan-T5 to outperform prior work by 3-17%+ across evaluation settings. We find task balancing and enrichment techniques are overlooked but critical to effective instruction tuning, and in particular, training with mixed prompt settings (zero-shot, few-shot, and chain-of-thought) actually yields stronger (2%+) performance in all settings. In further experiments, we show Flan-T5 requires less finetuning to converge higher and faster than T5 on single downstream tasks, motivating instruction-tuned models as more computationally-efficient starting checkpoints for new tasks. Finally, to accelerate research on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗allenai/open-instruct-flan-v2-7bmodel· 22 dl· ♡ 122 dl♡ 1
- 🤗allenai/open-instruct-cot-7bmodel· 24 dl· ♡ 124 dl♡ 1
- 🤗allenai/tulu-7bmodel· 43 dl· ♡ 943 dl♡ 9
- 🤗allenai/open-instruct-human-mix-7bmodel· 28 dl28 dl
- 🤗allenai/open-instruct-cot-13bmodel· 22 dl22 dl
- 🤗allenai/open-instruct-flan-v2-13bmodel· 13 dl13 dl
- 🤗allenai/open-instruct-human-mix-30bmodel· 18 dl· ♡ 118 dl♡ 1
- 🤗allenai/tulu-30bmodel· 41 dl· ♡ 1841 dl♡ 18
- 🤗allenai/open-instruct-human-mix-65bmodel· 897 dl· ♡ 4897 dl♡ 4
- 🤗allenai/tulu-65bmodel· 37 dl· ♡ 2137 dl♡ 21
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Ferroelectric and Negative Capacitance Devices · Machine Learning and Data Classification
MethodsAttention Is All You Need · Flan-T5 · Linear Layer · Byte Pair Encoding · Multi-Head Attention · Residual Connection · Dense Connections · Refunds@Expedia|||How do I get a full refund from Expedia? · Dropout · Softmax
