D-LiFT: Improving LLM-based Decompiler Backend via Code Quality-driven Fine-tuning
Muqi Zou, Hongyu Cai, Hongwei Wu, Zion Leonahenahe Basque, Arslan Khan, Berkay Celik, Dave (Jing) Tian, Antonio Bianchi, Ruoyu (Fish) Wang, Dongyan Xu

TL;DR
D-LIFT enhances LLM-based decompilers by fine-tuning with a code quality-aware reinforcement learning approach, significantly improving the readability and accuracy of decompiled code while preserving correctness.
Contribution
The paper introduces D-LIFT, a novel fine-tuning method for LLM-based decompilers that uses a new code quality assessment system, D-Score, to improve output quality without sacrificing accuracy.
Findings
55.3% more improved decompiled functions with D-LIFT
Improves 68.2% of all decompiled functions
Demonstrates significant quality improvements on coreutils and util-linux projects
Abstract
As one of the key tools in many security tasks, decompilers reconstruct human-readable source code from binaries. Yet, despite recent advances, their outputs often suffer from syntactic and semantic errors and remain difficult to read. Recently, with the advent of large language models (LLMs), researchers began to explore the potential of LLMs to refine decompiler output. Nevertheless, our study of these approaches reveals their problems, such as introducing new errors and relying on unreliable accuracy validation. In this paper, we present D-LIFT, an enhanced decompiler-LLM pipeline with a fine-tuned LLM using code quality-aware reinforcement learning. Unlike prior work that overlooks preserving accuracy, D-LIFT adheres to a key principle for enhancing the quality of decompiled code: preserving accuracy while improving readability. Central to D-LIFT, we propose D-Score, an integrated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Advanced Malware Detection Techniques · Security and Verification in Computing
