Input Reduction Enhanced LLM-based Program Repair

Boyang Yang; Luyao Ren; Xin Yin; Jiadong Ren; Haoye Tian; Shunfu Jin

arXiv:2507.15251·cs.SE·December 19, 2025

Input Reduction Enhanced LLM-based Program Repair

Boyang Yang, Luyao Ren, Xin Yin, Jiadong Ren, Haoye Tian, Shunfu Jin

PDF

Open Access

TL;DR

This paper introduces ReduceFix, a method that automatically reduces large failure-inducing test inputs to improve the effectiveness of LLM-based automated program repair, demonstrating significant performance gains on a new long-input benchmark.

Contribution

The paper proposes ReduceFix, a novel input reduction approach that enhances LLM-based program repair by minimizing test inputs without human effort, and introduces LFTBench, a benchmark for long-input APR with real bugs.

Findings

01

ReduceFix shrinks inputs by 89.1% on average.

02

Improves pass@10 by up to 53.8% with input reduction.

03

Enhances repair success of ChatRepair and CREF by 21.3% and 2.6%.

Abstract

Large Language Models (LLMs) have shown great potential in Automated Program Repair (APR). Test inputs, being crucial for reasoning the root cause of failures, are always included in the prompt for LLM-based APR. Unfortunately, LLMs struggle to retain key information in long prompts. When the test inputs are extensive in the prompt, this may trigger the "lost-in-the-middle" issue, compromising repair performance. ReduceFix prompts an LLM to generate a reducer that minimizes failure-inducing test inputs without human effort, and then feeds the reduced failure-inducing inputs to guide patch generation. For targeted evaluation, we constructed LFTBench, the first long-input APR benchmark with 200 real bugs from 20 programming tasks, each paired with a failure-inducing input whose median size is 1 MB. On this benchmark, ReduceFix shrinks inputs by 89.1% on average and improves overall…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Testing and Debugging Techniques · Software System Performance and Reliability · Software Engineering Research