Smart-LLaMA-DPO: Reinforced Large Language Model for Explainable Smart Contract Vulnerability Detection

Lei Yu; Zhirong Huang; Hang Yuan; Shiqi Cheng; Li Yang; Fengjun Zhang; Chenjie Shen; Jiajia Ma; Jingyuan Zhang; Junyi Lu; Chun Zuo

arXiv:2506.18245·cs.CR·June 24, 2025

Smart-LLaMA-DPO: Reinforced Large Language Model for Explainable Smart Contract Vulnerability Detection

Lei Yu, Zhirong Huang, Hang Yuan, Shiqi Cheng, Li Yang, Fengjun Zhang, Chenjie Shen, Jiajia Ma, Jingyuan Zhang, Junyi Lu, Chun Zuo

PDF

1 Models

TL;DR

This paper introduces Smart-LLaMA-DPO, a reinforced large language model tailored for explainable smart contract vulnerability detection, addressing dataset limitations and interpretation issues in blockchain security.

Contribution

It develops a comprehensive dataset, applies continual pre-training, supervised fine-tuning, and direct preference optimization to enhance LLM performance in vulnerability detection and explanation quality.

Findings

01

Significant improvement in detection accuracy and F1 score.

02

More accurate, thorough, and clear explanations from the model.

03

Outperforms state-of-the-art baselines in multiple vulnerability types.

Abstract

Smart contract vulnerability detection remains a major challenge in blockchain security. Existing vulnerability detection methods face two main issues: (1) Existing datasets lack comprehensive coverage and high-quality explanations for preference learning. (2) Large language models (LLMs) often struggle with accurately interpreting specific concepts in smart contract security. Empirical analysis shows that even after continual pre-training (CPT) and supervised fine-tuning (SFT), LLMs may misinterpret the execution order of state changes, resulting in incorrect explanations despite making correct detection decisions. To address these challenges, we propose Smart-LLaMA-DPO based on LLaMA-3.1-8B. We construct a comprehensive dataset covering four major vulnerability types and machine-unauditable vulnerabilities, including precise labels, explanations, and locations for SFT, as well as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
nothingisenough/solidity-smart-llama-base
model· 25 dl
25 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDirect Preference Optimization · Shrink and Fine-Tune