PoC-Gym: Towards More Reliable LLM-Assisted Proof-of-Concept Exploit Generation
Derin Gezgin, Amartya Das, Shinhae Kim, Zhengdong Huang, Nevena Stojkovic, Claire Wang

TL;DR
PoC-Gym is a pipeline that enhances the reliability of LLM-assisted Java PoC exploit generation by combining static and dynamic validation techniques, achieving promising results on 20 CVEs.
Contribution
It introduces a novel iterative PoC generation pipeline using static and dynamic info, improving validation reliability for LLM-generated exploits.
Findings
116 candidates passed runtime validation out of 338 runs
65 candidates were validated against ground-truth locations
PoC-Gym covered 12 of 20 CVEs in evaluation
Abstract
Recently Large Language Models (LLMs) have been used in security-related tasks, including generating proof-of-concept (PoC) exploits. Several LLM-assisted approaches have been proposed; they typically generate PoCs from vulnerability descriptions and use additional guidance. But, such approaches are often ineffective because the signals-such as printed markers, generated files, or runtime side effects-that they use for validation may not imply that the vulnerability is triggered. Research for more reliable PoC generation is in need but yet remains challenging. We propose PoC-Gym, a pipeline for LLM-based PoC generation for Java security vulnerabilities. PoC-Gym uses both static and dynamic information, e.g., CVE-tailored prompts, static traces, and coverage-based feedback, and iteratively generates PoC candidates. Each candidate goes through a series of validations: whether the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
