Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding

Taewon Yun; Jisu Shin; Jeonghwan Choi; Seunghwan Bang; Hwanjun Song

arXiv:2605.02290·cs.AI·May 5, 2026

Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding

Taewon Yun, Jisu Shin, Jeonghwan Choi, Seunghwan Bang, Hwanjun Song

PDF

1 Repo

TL;DR

The paper presents CoRD, a collaborative multi-teacher decoding method for efficient, high-quality reasoning data distillation in Long-CoT models, outperforming existing approaches.

Contribution

Introducing CoRD, a step-wise, collaborative decoding framework that leverages heterogeneous teachers and dynamic exploration for better reasoning data distillation.

Findings

01

CoRD produces higher-quality reasoning data.

02

Achieves near teacher-level student performance with fewer supervision signals.

03

Generalizes well to out-of-domain and open-ended tasks.

Abstract

Distilling large reasoning models is essential for making Long-CoT reasoning practical, as full-scale inference remains computationally prohibitive. Existing curation-based approaches select complete reasoning traces post-hoc, overlooking collaboration among heterogeneous teachers and lacking dynamic exploration, which leads to redundant sampling and missed complementary reasoning. We introduce CoRD, a collaborative multi-teacher decoding framework that performs step-wise reasoning synthesis guided by predictive perplexity-based scoring and beam search. This enables heterogeneous LRMs to jointly construct coherent reasoning trajectories while efficiently preserving diverse, high-potential hypotheses. Experiments show that CoRD produces higher-quality reasoning data and achieves near teacher-level student performance with fewer, structured supervision signals, without substantial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DISL-Lab/CoRD
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.