Decoupling Understanding from Reasoning via Problem Space Mapping for Small-Scale Model Reasoning
Li Wang, Changhao Zhang, Zengqi Xiu, Kai Lu, Xin Yu, Kui Zhang, Wenjun Wu

TL;DR
This paper introduces a novel framework that decouples understanding from reasoning in small language models by mapping problems into a simplified canonical space, significantly enhancing their reasoning performance and robustness.
Contribution
The paper proposes DURIT, a three-step iterative training algorithm that maps natural language problems into a canonical space, improving small models' reasoning abilities and robustness.
Findings
DURIT substantially improves reasoning accuracy on mathematical and logical tasks.
The framework enhances model robustness across in-domain and out-of-domain data.
Decoupling understanding from reasoning effectively strengthens small language models.
Abstract
Despite recent advances in the reasoning capabilities of Large Language Models (LLMs), improving the reasoning ability of Small Language Models (SLMs, e.g., up to 1.5B parameters) remains challenging. A key obstacle lies in the complexity and variability of natural language: essentially equivalent problems often appear in diverse surface forms, often obscured by redundant or distracting details. This imposes a dual burden on SLMs: they must first extract the core problem from complex linguistic input, and then perform reasoning based on that understanding. The resulting vast and noisy problem space hinders optimization, particularly for models with limited capacity. To address this, we propose a new framework that decouples understanding from reasoning by mapping natural language problems into a canonical problem space-a semantically simplified yet expressive domain. This enables SLMs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
