Instruction Tuning with Human Curriculum
Bruce W. Lee, Hyunsoo Cho, Kang Min Yoo

TL;DR
This paper introduces Curriculum Instruction Tuning, a method that organizes instruction data by difficulty to improve model performance across multiple benchmarks without extra computational cost.
Contribution
It presents a systematic curriculum-based data generation pipeline and demonstrates significant performance gains through curriculum ordering in instruction tuning.
Findings
Performance improvements of +4.76 on TruthfulQA and +2.98 on MMLU
Consistent benefits across nine benchmarks
No additional computational expenses required
Abstract
In this work, we (1) introduce Curriculum Instruction Tuning, (2) explore the potential advantages of employing diverse curriculum strategies, and (3) delineate a synthetic instruction-response generation framework that complements our theoretical approach. Distinct from the existing instruction tuning dataset, our generation pipeline is systematically structured to emulate the sequential and orderly characteristic of human learning. Additionally, we describe a methodology for generating instruction-response datasets that extensively span the various stages of human education, from middle school through the graduate level, utilizing educational subject catalogs. Before training, we meticulously organize the instruction data to ensure that questions escalate in difficulty regarding (A) the subject matter and (B) the intricacy of the instructions. The findings of our study reveal that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsOnline Learning and Analytics · Machine Learning and Data Classification · Educational Assessment and Pedagogy
MethodsMulti-Head Attention · Attention Is All You Need · Dense Connections · Linear Layer · Softmax · Residual Connection · Absolute Position Encodings · Layer Normalization · Adam · Byte Pair Encoding
