TL;DR
UniSD introduces a comprehensive self-distillation framework for large language models, systematically studying and combining multiple mechanisms to enhance model adaptation and performance without external teachers.
Contribution
The paper proposes UniSD, a unified framework that integrates various self-distillation techniques, providing new insights and achieving state-of-the-art results across multiple benchmarks.
Findings
Self-distillation improves model performance in certain tasks.
Component interactions influence the effectiveness of self-distillation.
The integrated UniSDfull pipeline outperforms baseline models significantly.
Abstract
Self-distillation (SD) offers a promising path for adapting large language models (LLMs) without relying on stronger external teachers. However, SD in autoregressive LLMs remains challenging because self-generated trajectories are free-form, correctness is task-dependent, and plausible rationales can still provide unstable or unreliable supervision. Existing methods mainly examine isolated design choices, leaving their effectiveness, roles, and interactions unclear. In this paper, we propose UniSD, a unified framework to systematically study self-distillation. UniSD integrates complementary mechanisms that address supervision reliability, representation alignment, and training stability, including multi-teacher agreement, EMA teacher stabilization, token-level contrastive learning, feature matching, and divergence clipping. Across six benchmarks and six models from three model families,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
