Augmenting Smart Contract Decompiler Output through Fine-grained Dependency Analysis and LLM-facilitated Semantic Recovery
Zeqin Liao, Yuhong Nan, Zixu Gao, Henglong Liang, Sicheng Hao, Peifan Reng, Zibin Zheng

TL;DR
SmartHalo enhances smart contract decompiler outputs by combining static analysis and large language models, significantly improving accuracy in reconstructing source code features like method boundaries and variable types.
Contribution
The paper introduces SmartHalo, a novel framework that integrates static analysis with LLMs to improve decompiler accuracy for Solidity smart contracts.
Findings
Achieves 87.39% precision in method boundary detection.
Reaches 90.39% accuracy in variable type recovery.
Improves overall decompilation quality compared to state-of-the-art tools.
Abstract
Decompiler is a specialized type of reverse engineering tool extensively employed in program analysis tasks, particularly in program comprehension and vulnerability detection. However, current Solidity smart contract decompilers face significant limitations in reconstructing the original source code. In particular, the bottleneck of SOTA decompilers lies in inaccurate method identification, incorrect variable type recovery, and missing contract attributes. These deficiencies hinder downstream tasks and understanding of the program logic. To address these challenges, we propose SmartHalo, a new framework that enhances decompiler output by combining static analysis (SA) and large language models (LLM). SmartHalo leverages the complementary strengths of SA's accuracy in control and data flow analysis and LLM's capability in semantic prediction. More specifically, \system{} constructs a new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Digital Rights Management and Security · Multi-Agent Systems and Negotiation
