CodeReasoner: Enhancing the Code Reasoning Ability with Reinforcement Learning

Lingxiao Tang; He Ye; Zhongxin Liu; Xiaoxue Ren; Lingfeng Bao

arXiv:2507.17548·cs.SE·July 24, 2025

CodeReasoner: Enhancing the Code Reasoning Ability with Reinforcement Learning

Lingxiao Tang, He Ye, Zhongxin Liu, Xiaoxue Ren, Lingfeng Bao

PDF

Open Access

TL;DR

CodeReasoner enhances code reasoning in large language models by combining dataset construction, instruction tuning, and reinforcement learning, leading to significant performance improvements over prior methods.

Contribution

It introduces a novel framework integrating dataset creation, instruction tuning, and reinforcement learning to improve code reasoning and generalization in LLMs.

Findings

01

Improves code reasoning benchmark performance by up to 40.2%.

02

7B model matches GPT-4o on key tasks.

03

Scaling to 14B surpasses GPT-4o across benchmarks.

Abstract

Code reasoning is a fundamental capability for large language models (LLMs) in the code domain. It involves understanding and predicting a program's execution behavior, such as determining the output for a given input or whether a specific statement will be executed. This capability is essential for downstream tasks like debugging, code generation, and program repair. Prior approaches mainly rely on supervised fine-tuning to improve performance in code reasoning tasks. However, they often show limited gains and fail to generalize across diverse scenarios. We argue this is due to two core issues: the low quality of training data and the limitations of supervised fine-tuning, which struggles to teach general reasoning skills. To address these challenges, we propose CodeReasoner, a framework that spans both dataset construction and a two-stage training process. First, we introduce a method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Software Engineering Techniques and Practices