Boosting Open-Source LLMs for Program Repair via Reasoning Transfer and LLM-Guided Reinforcement Learning

Xunzhu Tang; Jacques Klein; Tegawend\'e F. Bissyand\'e

arXiv:2506.03921·cs.SE·June 5, 2025

Boosting Open-Source LLMs for Program Repair via Reasoning Transfer and LLM-Guided Reinforcement Learning

Xunzhu Tang, Jacques Klein, Tegawend\'e F. Bissyand\'e

PDF

Open Access

TL;DR

This paper presents Repairity, a three-stage method that enhances open-source LLMs for program repair by extracting reasoning, transferring knowledge, and applying reinforcement learning, significantly narrowing the performance gap with closed-source models.

Contribution

Introduction of Repairity, a novel approach combining reasoning transfer and reinforcement learning to improve open-source LLMs for program repair.

Findings

01

Open-source LLMs improved by 8.68% on average.

02

Performance gap reduced from 10.05% to 1.35%.

03

Both reasoning extraction and reinforcement learning are effective.

Abstract

Several closed-source LLMs have consistently outperformed open-source alternatives in program repair tasks, primarily due to their superior reasoning capabilities and extensive pre-training. This paper introduces Repairity, a novel three-stage methodology that significantly narrows this performance gap through reasoning extraction and reinforcement learning. Our approach: (1) systematically filters high-quality reasoning traces from closed-source models using correctness verification, (2) transfers this reasoning knowledge to open-source models via supervised fine-tuning, and (3) develops reinforcement learning with LLM-based feedback to further optimize performance. Empirical evaluation across multiple program repair benchmarks demonstrates that Repairity improves the performance of Qwen2.5-Coder-32B-Instruct, a base open source LLM, by 8.68\% on average, reducing the capability gap…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Software Testing and Debugging Techniques · Teaching and Learning Programming