H\'an D\=an Xu\'e B\`u (Mimicry) or Q\=ing Ch\=u Y\'u L\'an (Mastery)? A Cognitive Perspective on Reasoning Distillation in Large Language Models

Yueqing Hu; Xinyang Peng; Shuting Peng; Hanqi Wang; Tianhong Wang

arXiv:2601.05019·cs.CL·April 24, 2026

H\'an D\=an Xu\'e B\`u (Mimicry) or Q\=ing Ch\=u Y\'u L\'an (Mastery)? A Cognitive Perspective on Reasoning Distillation in Large Language Models

Yueqing Hu, Xinyang Peng, Shuting Peng, Hanqi Wang, Tianhong Wang

PDF

TL;DR

This paper investigates how current reasoning distillation methods for large language models fail to transfer human-like cognitive structures, leading to superficial mimicry and negative transfer effects.

Contribution

It reveals that supervised fine-tuning causes a collapse in cognitive alignment, emphasizing the importance of reinforcement learning for genuine reasoning capabilities.

Findings

01

Distillation reduces alignment with human difficulty scaling from 0.64 to 0.34

02

Students often underperform their pre-distillation baselines

03

Reasoning distillation decouples computational cost from cognitive demand

Abstract

Recent Large Reasoning Models trained via reinforcement learning exhibit a "natural" alignment with human cognitive costs. However, we show that the prevailing paradigm of reasoning distillation -- training student models to mimic these traces via Supervised Fine-Tuning (SFT) -- fails to transmit this cognitive structure. Testing the "H\'an D\=an Xu\'e B\`u" (Superficial Mimicry) hypothesis across 14 models, we find that distillation induces a "Functional Alignment Collapse": while teacher models mirror human difficulty scaling ( $\overset{r}{ˉ} = 0.64$ ), distilled students significantly degrade this alignment ( $\overset{r}{ˉ} = 0.34$ ), often underperforming their own pre-distillation baselines ("Negative Transfer"). Our analysis suggests that SFT induces a "Cargo Cult" effect, where students ritualistically replicate the linguistic form of reasoning (verbosity) without internalizing the teacher's dynamic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.