Constraint-Guided Multi-Agent Decompilation for Executable Binary Recovery
Yifan Zhang, Xiaohan Wang, Yueke Zhang, Yu Huang, Kevin Leach

TL;DR
This paper introduces a multi-agent, constraint-guided decompilation framework that significantly improves the re-executability of binary code by iteratively refining decompiled source using hierarchical validation and LLM feedback.
Contribution
It presents a novel multi-agent system employing hierarchical constraints and LLMs to enhance decompiled code's correctness and executability, outperforming existing methods.
Findings
Achieves 84-97% re-executability on real-world binaries.
Outperforms state-of-the-art LLM-based decompilation methods.
Execution-based validation is critical for behavioral correctness.
Abstract
Decompilation -- recovering source code from compiled binaries -- is essential for security analysis, malware reverse engineering, and legacy software maintenance. However, existing decompilers produce code that often fails to compile or execute correctly, limiting their practical utility. We present a multi-agent framework that transforms decompiled code into re-executable source through Multi-level Constraint-Guided Decompilation (MCGD). Our approach employs a hierarchical validation pipeline with three constraint levels: (1) syntactic correctness via parsing, (2) compilability via GCC, and (3) behavioral equivalence via LLM-generated test cases. When validation fails, specialized LLM agents iteratively refine the code using structured error feedback. We evaluate our framework on 1,641 real-world binaries from ExeBench across three decompilers (RetDec, Ghidra, and Angr). Our framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
