Prompt to Pwn: Automated Exploit Generation for Smart Contracts

ZeKe Xiao; Qin Wang; Yuekang Li; Shiping Chen

arXiv:2508.01371·cs.CR·April 22, 2026

Prompt to Pwn: Automated Exploit Generation for Smart Contracts

ZeKe Xiao, Qin Wang, Yuekang Li, Shiping Chen

PDF

TL;DR

This paper introduces ReX, an execution-grounded framework that leverages large language models to automate exploit generation for smart contracts, evaluating their effectiveness across various vulnerabilities.

Contribution

It presents ReX, a novel end-to-end system integrating LLMs with smart contract testing, and provides a comprehensive evaluation of LLM capabilities in exploit synthesis.

Findings

01

LLMs can generate deterministic PoCs for single-contract vulnerabilities.

02

Performance varies significantly by model and bug type.

03

Current LLMs are less effective for cross-contract attacks.

Abstract

Smart contracts are important for digital finance, yet they are hard to patch once deployed. Prior work has mainly explored LLMs for smart contract vulnerability detection, leaving end-to-end automated exploit generation (AEG) much less understood. We study that gap with \textsc{ReX}, an execution-grounded framework that links LLM-based exploit synthesis to the Foundry stack for end-to-end generation, compilation, execution, and validation. Five recent LLMs are evaluated across eight common vulnerability classes, supported by a curated dataset of 38{+} real incident PoCs and three automation aids: prompt refactoring, a compiler feedback loop, and templated test harnesses. Results indicate that current frontier LLMs can often produce deterministic PoCs for single-contract vulnerabilities, but remain weak on cross-contract attacks; outcomes depend mainly on the model and bug type, while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.