Project Prometheus: Bridging the Intent Gap in Agentic Program Repair via Reverse-Engineered Executable Specifications

Yongchao Wang; Zhiqiu Huang

arXiv:2604.17464·cs.SE·April 21, 2026

Project Prometheus: Bridging the Intent Gap in Agentic Program Repair via Reverse-Engineered Executable Specifications

Yongchao Wang, Zhiqiu Huang

PDF

TL;DR

Prometheus introduces a specification inference framework for automated program repair that significantly improves correctness and rescue rates by aligning repairs with executable specifications derived from runtime failure reports.

Contribution

It proposes a novel multi-agent framework that reverse-engineers executable specifications to bridge the intent gap in program repair, outperforming existing methods.

Findings

01

Achieved 93.97% correct patch rate on Defects4J.

02

Rescued 74.4% of complex bugs that blind agents failed to fix.

03

Explicit intent guidance reduces over-engineering and improves repair precision.

Abstract

The transition from neural machine translation to agentic workflows has revolutionized Automated Program Repair (APR). However, existing agents, despite their advanced reasoning capabilities, frequently suffer from the ``Intent Gap'' -- the misalignment between the generated patch and the developer's original intent. Current solutions relying on natural language summaries or adversarial sampling often fail to provide the deterministic constraints required for surgical repairs. In this paper, we introduce \textsc{Prometheus}, a novel framework that bridges this gap by prioritizing \textit{Specification Inference} over code generation. We employ Behavior-Driven Development (BDD) as an executable contract, utilizing a multi-agent architecture to reverse-engineer Gherkin specifications from runtime failure reports. To resolve the ``Hallucination of Intent,'' we propose a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.