Empowering Autonomous Debugging Agents with Efficient Dynamic Analysis
Jiahong Xiang, Xiaoyang Xu, Xiaopan Chu, Hongliang Tian, Yuqun Zhang

TL;DR
This paper introduces ADI, a cost-efficient, function-level debugging interface for autonomous agents, significantly improving their effectiveness and efficiency in program repair tasks.
Contribution
The paper presents ADI, a novel agent-centric debugging interface with a function-level interaction paradigm, enhancing autonomous agents' performance and cost-efficiency.
Findings
ADI resolves 63.8% of tasks on SWE-bench, outperforming some existing agents.
ADI achieves an average cost of USD 1.28 per task.
Integrating ADI into existing agents yields 6.2% to 18.5% performance gains.
Abstract
Autonomous agents for automated program repair represent a promising frontier in software engineering, yet their effectiveness is often hindered by reliance on post-mortem, coarse-grained execution feedback. While integrating traditional interactive debuggers seems a natural solution, their low-level, line-by-line interaction paradigm turns out to be cost-inefficient for LLM-based agents, leading to exhausted budgets and unproductive loops. To mitigate this, we introduce Agent-centric Debugging Interface (ADI), a novel agent-centric debugging interface designed for cost-efficient, end-to-end autonomous interaction. Specifically, Agent-centric Debugging Interface realizes a function-level interaction paradigm, powered by our Frame Lifetime Trace, a comprehensive data structure encapsulating a function's stateful execution trace, and a set of high-level navigational commands. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
