Small Models Struggle to Learn from Strong Reasoners

Yuetai Li; Xiang Yue; Zhangchen Xu; Fengqing Jiang; Luyao Niu; Bill Yuchen Lin; Bhaskar Ramasubramanian; Radha Poovendran

arXiv:2502.12143·cs.AI·November 14, 2025

Small Models Struggle to Learn from Strong Reasoners

Yuetai Li, Xiang Yue, Zhangchen Xu, Fengqing Jiang, Luyao Niu, Bill Yuchen Lin, Bhaskar Ramasubramanian, Radha Poovendran

PDF

Open Access 1 Repo 1 Models 5 Datasets 1 Video

TL;DR

This paper investigates why small language models struggle with complex reasoning and introduces Mix Distillation, a method that combines reasoning examples of varying complexity to enhance small model performance.

Contribution

The paper identifies the Small Model Learnability Gap and proposes Mix Distillation to improve small model reasoning by balancing reasoning complexity during training.

Findings

01

Mix Distillation improves small model reasoning accuracy.

02

Small models perform better with shorter, simpler reasoning chains.

03

Direct distillation from large models is less effective for small models.

Abstract

Large language models (LLMs) excel in complex reasoning tasks, and distilling their reasoning capabilities into smaller models has shown promise. However, we uncover an interesting phenomenon, which we term the Small Model Learnability Gap: small models ( $\leq$ 3B parameters) do not consistently benefit from long chain-of-thought (CoT) reasoning or distillation from larger models. Instead, they perform better when fine-tuned on shorter, simpler reasoning chains that better align with their intrinsic learning capacity. To address this, we propose Mix Distillation, a simple yet effective strategy that balances reasoning complexity by combining long and short CoT examples or reasoning from both larger and smaller models. Our experiments demonstrate that Mix Distillation significantly improves small model reasoning performance compared to training on either data alone. These findings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Small-Model-Gap/Small-Model-Learnability-Gap
pytorch

Models

🤗
quwsarohi/SmolThink
model· 1 dl· ♡ 1
1 dl♡ 1

Datasets

Videos

Small Models Struggle to Learn from Strong Reasoners· underline

Taxonomy

TopicsComplex Systems and Decision Making · Statistics Education and Methodologies · Reservoir Engineering and Simulation Methods

MethodsALIGN