Summary-Mediated Repair: Can LLMs use code summarisation as a tool for program repair?
Lukas Twist

TL;DR
This paper introduces summary-mediated repair, a novel approach using code summarisation as an intermediate step to improve program repair in large language models, showing modest but consistent gains across multiple models and benchmarks.
Contribution
The paper proposes a prompt-only pipeline that leverages natural-language code summarisation for program repair, extending previous work on summarisation as an intermediary for downstream tasks.
Findings
Error-aware summaries repair up to 65% of unseen errors.
Summaries improve repair performance by an average of 5% over baseline.
The approach is effective across eight production-grade LLMs and two benchmarks.
Abstract
Large Language Models (LLMs) often produce code with subtle implementation-level bugs despite strong benchmark performance. These errors are hard for LLMs to spot and can have large behavioural effects; yet when asked to summarise code, LLMs can frequently surface high-level intent and sometimes overlook this low-level noise. Motivated by this, we propose summary-mediated repair, a prompt-only pipeline for program repair that leverages natural-language code summarisation as an explicit intermediate step, extending previous work that has already shown code summarisation to be a useful intermediary for downstream tasks. We evaluate our method across eight production-grade LLMs on two function level benchmarks (HumanEvalPack and MBPP), comparing several summary styles against a direct repair baseline. Error-aware diagnostic summaries consistently yield the largest gains - repairing up to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Software System Performance and Reliability
