Boosting Neural Networks to Decompile Optimized Binaries
Ying Cao, Ruigang Liang, Kai Chen, Peiwei Hu

TL;DR
This paper introduces NeurDP, a neural decompiler that effectively handles compiler-optimized binaries by using a graph neural network and an intermediate representation, significantly improving decompilation accuracy.
Contribution
NeurDP is the first neural decompiler designed specifically for compiler-optimized binaries, utilizing GNNs and an OTU to enhance translation accuracy.
Findings
NeurDP achieves 45.21% higher accuracy than existing frameworks.
The use of GNNs and OTU improves decompilation of optimized binaries.
Evaluation on diverse datasets confirms its effectiveness.
Abstract
Decompilation aims to transform a low-level program language (LPL) (eg., binary file) into its functionally-equivalent high-level program language (HPL) (e.g., C/C++). It is a core technology in software security, especially in vulnerability discovery and malware analysis. In recent years, with the successful application of neural machine translation (NMT) models in natural language processing (NLP), researchers have tried to build neural decompilers by borrowing the idea of NMT. They formulate the decompilation process as a translation problem between LPL and HPL, aiming to reduce the human cost required to develop decompilation tools and improve their generalizability. However, state-of-the-art learning-based decompilers do not cope well with compiler-optimized binaries. Since real-world binaries are mostly compiler-optimized, decompilers that do not consider optimized binaries have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGraph Neural Network
