PoC-Gym: Towards More Reliable LLM-Assisted Proof-of-Concept Exploit Generation

Derin Gezgin; Amartya Das; Shinhae Kim; Zhengdong Huang; Nevena Stojkovic; Claire Wang

arXiv:2602.04165·cs.SE·May 14, 2026

PoC-Gym: Towards More Reliable LLM-Assisted Proof-of-Concept Exploit Generation

Derin Gezgin, Amartya Das, Shinhae Kim, Zhengdong Huang, Nevena Stojkovic, Claire Wang

PDF

TL;DR

PoC-Gym is a pipeline that enhances the reliability of LLM-assisted Java PoC exploit generation by combining static and dynamic validation techniques, achieving promising results on 20 CVEs.

Contribution

It introduces a novel iterative PoC generation pipeline using static and dynamic info, improving validation reliability for LLM-generated exploits.

Findings

01

116 candidates passed runtime validation out of 338 runs

02

65 candidates were validated against ground-truth locations

03

PoC-Gym covered 12 of 20 CVEs in evaluation

Abstract

Recently Large Language Models (LLMs) have been used in security-related tasks, including generating proof-of-concept (PoC) exploits. Several LLM-assisted approaches have been proposed; they typically generate PoCs from vulnerability descriptions and use additional guidance. But, such approaches are often ineffective because the signals-such as printed markers, generated files, or runtime side effects-that they use for validation may not imply that the vulnerability is triggered. Research for more reliable PoC generation is in need but yet remains challenging. We propose PoC-Gym, a pipeline for LLM-based PoC generation for Java security vulnerabilities. PoC-Gym uses both static and dynamic information, e.g., CVE-tailored prompts, static traces, and coverage-based feedback, and iteratively generates PoC candidates. Each candidate goes through a series of validations: whether the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.