PCodeTrans: Translate Decompiled Pseudocode to Compilable and Executable Equivalent
Yuxin Cui, Zeyu Gao, Shuxian He, Siliang Qin, Chao Zhang

TL;DR
PCodeTrans is a feedback-driven framework that translates decompiled pseudocode into compilable, executable code with high behavioral fidelity, enabling reliable binary analysis and software modernization.
Contribution
It introduces a novel, dynamic validation-guided approach that ensures decompiled code is both syntactically correct and behaviorally consistent with the original binary.
Findings
Achieves 100% function-level compilability on unstripped binaries.
Attains over 99.5% behavioral consistency in test validation.
Resolves over 76% of logic errors in official test suites.
Abstract
Decompilation is foundational to binary analysis, yet conventional tools prioritize human readability over strict recompilability and verifiable runtime correctness. While recent LLM-based approaches attempt to refine decompiled pseudocode, they typically either optimize solely for readability or rely on static analysis for evaluation. This makes them prone to "semantic hallucinations" that compromise accuracy and fail to resolve actual runtime failures. For critical tasks like software modernization and vulnerability remediation, recovered code must not only compile but replicate the original binary's behavior. We present PCodeTrans, a feedback-driven framework that bridges the gap between decompilation, recompilation, and rigorous function-level dynamic validation. After extracting a minimal yet coherent context to guarantee recompilability, PCodeTrans employs an in situ substitutable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSecurity and Verification in Computing · Advanced Malware Detection Techniques · Logic, programming, and type systems
