Project Prometheus: Bridging the Intent Gap in Agentic Program Repair via Reverse-Engineered Executable Specifications
Yongchao Wang, Zhiqiu Huang

TL;DR
Prometheus introduces a specification inference framework for automated program repair that significantly improves correctness and rescue rates by aligning repairs with executable specifications derived from runtime failure reports.
Contribution
It proposes a novel multi-agent framework that reverse-engineers executable specifications to bridge the intent gap in program repair, outperforming existing methods.
Findings
Achieved 93.97% correct patch rate on Defects4J.
Rescued 74.4% of complex bugs that blind agents failed to fix.
Explicit intent guidance reduces over-engineering and improves repair precision.
Abstract
The transition from neural machine translation to agentic workflows has revolutionized Automated Program Repair (APR). However, existing agents, despite their advanced reasoning capabilities, frequently suffer from the ``Intent Gap'' -- the misalignment between the generated patch and the developer's original intent. Current solutions relying on natural language summaries or adversarial sampling often fail to provide the deterministic constraints required for surgical repairs. In this paper, we introduce \textsc{Prometheus}, a novel framework that bridges this gap by prioritizing \textit{Specification Inference} over code generation. We employ Behavior-Driven Development (BDD) as an executable contract, utilizing a multi-agent architecture to reverse-engineer Gherkin specifications from runtime failure reports. To resolve the ``Hallucination of Intent,'' we propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
