What Really Improves Mathematical Reasoning: Structured Reasoning Signals Beyond Pure Code

Yuze Zhao; Junpeng Fang; Lu Yu; Zhenya Huang; Kai Zhang; Qing Cui; Qi Liu; Jun Zhou; Enhong Chen

arXiv:2605.19762·cs.AI·May 20, 2026

What Really Improves Mathematical Reasoning: Structured Reasoning Signals Beyond Pure Code

Yuze Zhao, Junpeng Fang, Lu Yu, Zhenya Huang, Kai Zhang, Qing Cui, Qi Liu, Jun Zhou, Enhong Chen

PDF

TL;DR

This paper investigates how structured reasoning signals beyond pure code influence mathematical reasoning in language models, revealing that domain-specific data and structured traces improve reasoning abilities.

Contribution

It demonstrates that structured reasoning signals, rather than code itself, enhance mathematical reasoning, and offers data-centric strategies to optimize model capabilities.

Findings

01

Code improves programming but not general reasoning when isolated.

02

Structured math-text samples significantly boost mathematical reasoning.

03

Data composition influences model performance and capability transfer.

Abstract

Code has become a standard component of modern foundation language model (LM) training, yet its role beyond programming remains unclear. We revisit the claim that code improves reasoning through controlled pretraining experiments on a 10T-token corpus with fine-grained domain separation. Our findings are threefold. First, when code is restricted to standalone executable programs and Code-NL data are controlled for, code substantially improves programming ability but does not act as a general reasoning enhancer; instead, it competes with knowledge-intensive tasks, especially complex mathematical reasoning. Second, the reasoning gains often attributed to code are better explained by cross-domain structured reasoning traces, such as code-text and math-text mixtures, rather than by executable code alone. Third, increasing the density of structured math-domain samples within a fixed math…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.