Runtime Execution Traces Guided Automated Program Repair with Multi-Agent Debate
Jiaqing Wu, Tong Wu, Manqing Zhang, Yunwei Dong, Bo Shen

TL;DR
TraceRepair is a multi-agent framework that uses runtime execution traces as shared constraints to improve automated program repair, significantly outperforming existing LLM-based methods on benchmark datasets.
Contribution
It introduces a novel multi-agent approach leveraging runtime facts as shared constraints for more accurate and dynamic program repair.
Findings
Correctly fixed 392 defects on Defects4J benchmark
Outperformed existing LLM-based approaches significantly
Demonstrated improved efficiency and generalization on new bug datasets
Abstract
Automated Program Repair (APR) struggles with complex logic errors and silent failures. Current LLM-based APR methods are mostly static, relying on source code and basic test outputs, which fail to accurately capture complex runtime behaviors and dynamic data dependencies. While incorporating runtime evidence like execution traces exposes concrete state transitions, a single LLM interpreting this in isolation often overfits to specific hypotheses, producing patches that satisfy tests by coincidence rather than correct logic. Therefore, runtime evidence should act as objective constraints rather than mere additional input. We propose TraceRepair, a multi-agent framework that leverages runtime facts as shared constraints for patch validation. A probe agent captures execution snapshots of critical variables to form an objective repair basis. Meanwhile, a committee of specialized agents…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
