UniDebugger: Hierarchical Multi-Agent Framework for Unified Software Debugging
Cheryl Lee, Chunqiu Steven Xia, Longji Yang, Jen-tse Huang, Zhouruixin Zhu, Lingming Zhang, Michael R. Lyu

TL;DR
UniDebugger introduces a hierarchical multi-agent framework that mimics developer cognition for comprehensive, adaptive software debugging, significantly outperforming existing methods without needing ground-truth root causes.
Contribution
It presents the first end-to-end multi-agent debugging framework that models developer cognition and handles complex bugs adaptively, outperforming state-of-the-art methods.
Findings
FixAgent fixes 1.25x to 2.56x more bugs than baselines.
It operates without requiring ground-truth root-cause code.
The framework is validated on extensive benchmarks.
Abstract
Software debugging is a time-consuming endeavor involving a series of steps, such as fault localization and patch generation, each requiring thorough analysis and a deep understanding of the underlying logic. While large language models (LLMs) demonstrate promising potential in coding tasks, their performance in debugging remains limited. Current LLM-based methods often focus on isolated steps and struggle with complex bugs. In this paper, we propose the first end-to-end framework, FixAgent, for unified debugging through multi-agent synergy. It mimics the entire cognitive processes of developers, with each agent specialized as a particular component of this process rather than mirroring the actions of an independent expert as in previous multi-agent systems. Agents are coordinated through a three-level design, following a cognitive model of debugging, allowing adaptive handling of bugs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Advanced Software Engineering Methodologies · Software Engineering Research
