PatchPilot: A Cost-Efficient Software Engineering Agent with Early Attempts on Formal Verification
Hongwei Li, Yuheng Tang, Shiqi Wang, Wenbo Guo

TL;DR
PatchPilot is a novel rule-based software patching agent that balances efficacy, stability, and cost-efficiency, outperforming existing methods on SWE-bench with low cost and high stability.
Contribution
We introduce PatchPilot, a rule-based patching workflow with novel components and designs that improve patching performance while maintaining low cost and high stability.
Findings
Outperforms existing open-source patching methods on SWE-bench
Maintains low cost under $1 per instance
Ensures higher stability compared to prior approaches
Abstract
Recent research builds various patching agents that combine large language models (LLMs) with non-ML tools and achieve promising results on the state-of-the-art (SOTA) software patching benchmark, SWE-bench. Based on how to determine the patching workflows, existing patching agents can be categorized as agent-based planning methods, which rely on LLMs for planning, and rule-based planning methods, which follow a pre-defined workflow. At a high level, agent-based planning methods achieve high patching performance but with a high cost and limited stability. Rule-based planning methods, on the other hand, are more stable and efficient but have key workflow limitations that compromise their patching performance. In this paper, we propose PatchPilot, an agentic patcher that strikes a balance between patching efficacy, stability, and cost-efficiency. PatchPilot proposes a novel rule-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModular Robots and Swarm Intelligence · Peer-to-Peer Network Technologies
MethodsActivation Patching
