Less is More: Selective Reflection for Compatible and Efficient Knowledge Distillation in Large Language Models

Lingyuan Liu; Mengxiang Zhang

arXiv:2508.06135·cs.CL·August 11, 2025

Less is More: Selective Reflection for Compatible and Efficient Knowledge Distillation in Large Language Models

Lingyuan Liu, Mengxiang Zhang

PDF

Open Access

TL;DR

This paper introduces Selective Reflection Distillation (SRD), a data curation framework that improves large language model distillation by selecting high-quality, compatible training data, leading to better performance and reduced training costs.

Contribution

SRD is a novel, plug-and-play data curation method that enhances knowledge distillation by systematically selecting and scheduling training data based on model reflections and difficulty.

Findings

01

SRD improves distilled model performance across various benchmarks.

02

SRD reduces training runtime by up to 39%.

03

SRD enhances sample efficiency without altering existing KD algorithms.

Abstract

Knowledge Distillation (KD) is a fundamental technique for compressing large language models (LLMs) into compact, efficient student models. However, existing white-box KD methods mainly focus on balancing ground truth and student-generated responses while overlooking two critical factors: training data quality and student-model compatibility. To address these limitations, we propose Selective Reflection Distillation (SRD), a novel data curation framework that leverages reflections from student models to systematically refine training data. SRD dynamically evaluates and selects prompt-response pairs by comparing ground truth data with student model outputs, selectively curating high-quality, student-compatible training instances through automated ranking based on difficulty. Furthermore, after selecting the training data, a curriculum scheduling strategy is employed to incrementally…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques