DebugHarness: Emulating Human Dynamic Debugging for Autonomous Program Repair

Maolin Sun; Yibiao Yang; Xuanlin Liu; Yuming Zhou; Baowen Xu

arXiv:2604.03610·cs.SE·April 7, 2026

DebugHarness: Emulating Human Dynamic Debugging for Autonomous Program Repair

Maolin Sun, Yibiao Yang, Xuanlin Liu, Yuming Zhou, Baowen Xu

PDF

TL;DR

DebugHarness is an autonomous debugging agent that emulates human dynamic debugging practices to effectively diagnose and repair complex security vulnerabilities in software, outperforming existing static analysis methods.

Contribution

It introduces a novel LLM-powered debugging approach that actively interacts with runtime environments, enabling dynamic diagnosis and repair of low-level memory safety bugs.

Findings

01

DebugHarness patches about 90% of bugs in SEC-bench dataset.

02

It achieves over 30% improvement over state-of-the-art baselines.

03

Dynamic debugging significantly enhances LLM diagnostic capabilities.

Abstract

Patching severe security flaws in complex software remains a major challenge. While automated tools like fuzzers efficiently discover bugs, fixing deep-rooted low-level faults (e.g., use-after-free and memory corruption) still requires labor-intensive manual analysis by experts. Emerging Large Language Model (LLM) agents attempt to automate this pipeline, but they typically treat bug fixing as a purely static code-generation task. Relying solely on static artifacts, these methods miss the dynamic execution context strictly necessary for diagnosing intricate memory safety violations. To overcome these limitations, we introduce DebugHarness, an autonomous LLM-powered debugging agent harness that resolves complex vulnerabilities by emulating the interactive debugging practices of human systems engineers. Instead of merely examining static code, DebugHarness actively queries the live…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.