Input Reduction Enhanced LLM-based Program Repair
Boyang Yang, Luyao Ren, Xin Yin, Jiadong Ren, Haoye Tian, Shunfu Jin

TL;DR
This paper introduces ReduceFix, a method that automatically reduces large failure-inducing test inputs to improve the effectiveness of LLM-based automated program repair, demonstrating significant performance gains on a new long-input benchmark.
Contribution
The paper proposes ReduceFix, a novel input reduction approach that enhances LLM-based program repair by minimizing test inputs without human effort, and introduces LFTBench, a benchmark for long-input APR with real bugs.
Findings
ReduceFix shrinks inputs by 89.1% on average.
Improves pass@10 by up to 53.8% with input reduction.
Enhances repair success of ChatRepair and CREF by 21.3% and 2.6%.
Abstract
Large Language Models (LLMs) have shown great potential in Automated Program Repair (APR). Test inputs, being crucial for reasoning the root cause of failures, are always included in the prompt for LLM-based APR. Unfortunately, LLMs struggle to retain key information in long prompts. When the test inputs are extensive in the prompt, this may trigger the "lost-in-the-middle" issue, compromising repair performance. ReduceFix prompts an LLM to generate a reducer that minimizes failure-inducing test inputs without human effort, and then feeds the reduced failure-inducing inputs to guide patch generation. For targeted evaluation, we constructed LFTBench, the first long-input APR benchmark with 200 real bugs from 20 programming tasks, each paired with a failure-inducing input whose median size is 1 MB. On this benchmark, ReduceFix shrinks inputs by 89.1% on average and improves overall…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software System Performance and Reliability · Software Engineering Research
