YODA: Teacher-Student Progressive Learning for Language Models
Jianqiao Lu, Wanjun Zhong, Yufei Wang, Zhijiang Guo, Qi Zhu, Wenyong, Huang, Yanlin Wang, Fei Mi, Baojun Wang, Yasheng Wang, Lifeng Shang, Xin, Jiang, Qun Liu

TL;DR
YODA introduces a teacher-student progressive learning framework for language models that mimics human educational processes, leading to significant performance improvements in math reasoning tasks.
Contribution
This paper presents a novel progressive learning framework that organizes model training through a human-inspired basic-generalized-harder curriculum, enhancing fine-tuning efficacy.
Findings
Training LLaMA2 with YODA data improves GSM8K accuracy by 17.01%.
Training with curriculum learning further boosts robustness.
YODA's approach outperforms traditional fine-tuning methods.
Abstract
Although large language models (LLMs) have demonstrated adeptness in a range of tasks, they still lag behind human learning efficiency. This disparity is often linked to the inherent human capacity to learn from basic examples, gradually generalize and handle more complex problems, and refine their skills with continuous feedback. Inspired by this, this paper introduces YODA, a novel teacher-student progressive learning framework that emulates the teacher-student education process to improve the efficacy of model fine-tuning. The framework operates on an interactive \textit{basic-generalized-harder} loop. The teacher agent provides tailored feedback on the student's answers, and systematically organizes the education process. This process unfolds by teaching the student basic examples, reinforcing understanding through generalized questions, and then enhancing learning by posing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
MethodsShrink and Fine-Tune
