Less is More: Selective Reflection for Compatible and Efficient Knowledge Distillation in Large Language Models
Lingyuan Liu, Mengxiang Zhang

TL;DR
This paper introduces Selective Reflection Distillation (SRD), a data curation framework that improves large language model distillation by selecting high-quality, compatible training data, leading to better performance and reduced training costs.
Contribution
SRD is a novel, plug-and-play data curation method that enhances knowledge distillation by systematically selecting and scheduling training data based on model reflections and difficulty.
Findings
SRD improves distilled model performance across various benchmarks.
SRD reduces training runtime by up to 39%.
SRD enhances sample efficiency without altering existing KD algorithms.
Abstract
Knowledge Distillation (KD) is a fundamental technique for compressing large language models (LLMs) into compact, efficient student models. However, existing white-box KD methods mainly focus on balancing ground truth and student-generated responses while overlooking two critical factors: training data quality and student-model compatibility. To address these limitations, we propose Selective Reflection Distillation (SRD), a novel data curation framework that leverages reflections from student models to systematically refine training data. SRD dynamically evaluates and selects prompt-response pairs by comparing ground truth data with student model outputs, selectively curating high-quality, student-compatible training instances through automated ranking based on difficulty. Furthermore, after selecting the training data, a curriculum scheduling strategy is employed to incrementally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
