AgentStepper: Interactive Debugging of Software Development Agents
Robert Hutter, Michael Pradel

TL;DR
AgentStepper is an interactive debugger designed for LLM-based software development agents, enabling developers to better understand, control, and debug complex agent trajectories through structured, high-level interactions.
Contribution
This paper introduces AgentStepper, the first interactive debugging tool for LLM-powered software agents, supporting structured inspection, control, and live editing of agent trajectories.
Findings
AgentStepper requires minimal code modifications (39-42 lines) to existing agents.
User study shows improved bug detection success rate from 17% to 60%.
Participants reported reduced workload and frustration when using AgentStepper.
Abstract
Software development agents powered by large language models (LLMs) have shown great promise in automating tasks like environment setup, issue solving, and program repair. Unfortunately, understanding and debugging such agents remain challenging due to their complex and dynamic nature. Developers must reason about trajectories of LLM queries, tool calls, and code modifications, but current techniques reveal little of this intermediate process in a comprehensible format. The key insight of this paper is that debugging software development agents shares many similarities with conventional debugging of software programs, yet requires a higher level of abstraction that raises the level from low-level implementation details to high-level agent actions. Drawing on this insight, we introduce AgentStepper, the first interactive debugger for LLM-based software engineering agents. AgentStepper…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Techniques and Practices · Software Engineering Research · Multi-Agent Systems and Negotiation
