HintMR: Eliciting Stronger Mathematical Reasoning in Small Language Models
Jawad Hossain, Xiangyu Guo, Jiawei Zhou, Chong Liu

TL;DR
HintMR introduces a cooperative two-model framework where a hint-generating small language model guides a reasoning model through multi-step mathematical problems, significantly improving accuracy.
Contribution
This work presents a novel hint-assisted reasoning framework that decomposes solutions into steps with context-aware hints generated by a separate model, enhancing small language models' mathematical reasoning.
Findings
Hint assistance improves reasoning accuracy across benchmarks.
The cooperative system reduces error propagation in multi-step reasoning.
Structured hint generation enhances small language models' problem-solving capabilities.
Abstract
Small language models (SLMs) often struggle with complex mathematical reasoning due to limited capacity to maintain long chains of intermediate steps and to recover from early errors. We address this challenge by introducing a hint-assisted reasoning framework that incrementally guides SLMs through multi-step mathematical problem solving. Our approach decomposes solutions into sequential reasoning steps and provides context-aware hints, where hints are generated by a separate SLM trained via distillation from a strong large language model. While the hint-generating SLM alone is not capable of solving the problems, its collaboration with a reasoning SLM enables effective guidance, forming a cooperative two-model system for reasoning. Each hint is generated conditionally on the problem statement and the accumulated reasoning history, providing stepwise, localized guidance without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
