MIND: From Passive Mimicry to Active Reasoning through Capability-Aware Multi-Perspective CoT Distillation

Jin Cui; Jiaqi Guo; Jiepeng Zhou; Ruixuan Yang; Jiayi Lu; Jiajun Xu; Jiangcheng Song; Boran Zhao; Pengju Ren

arXiv:2601.03717·cs.CL·January 8, 2026

MIND: From Passive Mimicry to Active Reasoning through Capability-Aware Multi-Perspective CoT Distillation

Jin Cui, Jiaqi Guo, Jiepeng Zhou, Ruixuan Yang, Jiayi Lu, Jiajun Xu, Jiangcheng Song, Boran Zhao, Pengju Ren

PDF

Open Access

TL;DR

MIND introduces a capability-aware distillation framework that synthesizes diverse reasoning perspectives and dynamically aligns supervision to improve small model reasoning across various domains.

Contribution

It presents a novel framework with a Teaching Assistant network and feedback-driven calibration to enhance reasoning transfer and generalization in smaller models.

Findings

01

Achieves state-of-the-art results on multiple benchmarks.

02

Effectively mitigates catastrophic forgetting during training.

03

Enhances reasoning ability internalization in small models.

Abstract

While Large Language Models (LLMs) have emerged with remarkable capabilities in complex tasks through Chain-of-Thought reasoning, practical resource constraints have sparked interest in transferring these abilities to smaller models. However, achieving both domain performance and cross-domain generalization remains challenging. Existing approaches typically restrict students to following a single golden rationale and treat different reasoning paths independently. Due to distinct inductive biases and intrinsic preferences, alongside the student's evolving capacity and reasoning preferences during training, a teacher's "optimal" rationale could act as out-of-distribution noise. This misalignment leads to a degeneration of the student's latent reasoning distribution, causing suboptimal performance. To bridge this gap, we propose MIND, a capability-adaptive framework that transitions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Topic Modeling · Multimodal Machine Learning Applications