UniSD: Towards a Unified Self-Distillation Framework for Large Language Models

Yiqiao Jin; Yiyang Wang; Lucheng Fu; Yijia Xiao; Yinyi Luo; Haoxin Liu; B. Aditya Prakash; Josiah Hester; Jindong Wang; Srijan Kumar

arXiv:2605.06597·cs.CL·May 22, 2026

UniSD: Towards a Unified Self-Distillation Framework for Large Language Models

Yiqiao Jin, Yiyang Wang, Lucheng Fu, Yijia Xiao, Yinyi Luo, Haoxin Liu, B. Aditya Prakash, Josiah Hester, Jindong Wang, Srijan Kumar

PDF

1 Repo

TL;DR

UniSD introduces a comprehensive self-distillation framework for large language models, systematically studying and combining multiple mechanisms to enhance model adaptation and performance without external teachers.

Contribution

The paper proposes UniSD, a unified framework that integrates various self-distillation techniques, providing new insights and achieving state-of-the-art results across multiple benchmarks.

Findings

01

Self-distillation improves model performance in certain tasks.

02

Component interactions influence the effectiveness of self-distillation.

03

The integrated UniSDfull pipeline outperforms baseline models significantly.

Abstract

Self-distillation (SD) offers a promising path for adapting large language models (LLMs) without relying on stronger external teachers. However, SD in autoregressive LLMs remains challenging because self-generated trajectories are free-form, correctness is task-dependent, and plausible rationales can still provide unstable or unreliable supervision. Existing methods mainly examine isolated design choices, leaving their effectiveness, roles, and interactions unclear. In this paper, we propose UniSD, a unified framework to systematically study self-distillation. UniSD integrates complementary mechanisms that address supervision reliability, representation alignment, and training stability, including multi-teacher agreement, EMA teacher stabilization, token-level contrastive learning, feature matching, and divergence clipping. Across six benchmarks and six models from three model families,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ahren09/UniSD
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.