Patch-to-PoC: A Systematic Study of Agentic LLM Systems for Linux Kernel N-Day Reproduction

Juefei Pu; Xingyu Li; Zhengchuan Liang; Jonathan Cox; Yifan Wu; Kareem Shehada; Arrdya Srivastav; Zhiyun Qian

arXiv:2602.07287·cs.CR·February 20, 2026

Patch-to-PoC: A Systematic Study of Agentic LLM Systems for Linux Kernel N-Day Reproduction

Juefei Pu, Xingyu Li, Zhengchuan Liang, Jonathan Cox, Yifan Wu, Kareem Shehada, Arrdya Srivastav, Zhiyun Qian

PDF

Open Access

TL;DR

This paper systematically evaluates an LLM-based system, K-Repro, for autonomously reproducing Linux kernel vulnerabilities, demonstrating over 50% success in generating proof-of-concept exploits for real-world N-day vulnerabilities.

Contribution

It introduces K-Repro, the first large-scale LLM-based system capable of reproducing Linux kernel N-day vulnerabilities with detailed analysis of its effectiveness and limitations.

Findings

01

Reproduces over 50% of vulnerabilities in the dataset

02

Identifies key factors influencing success and failure

03

Provides insights for improving autonomous security agents

Abstract

Autonomous large language model (LLM) based systems have recently shown promising results across a range of cybersecurity tasks. However, there is no systematic study on their effectiveness in autonomously reproducing Linux kernel vulnerabilities with concrete proofs-of-concept (PoCs). Owing to the size, complexity, and low-level nature of the Linux kernel, such tasks are widely regarded as particularly challenging for current LLM-based approaches. In this paper, we present the first large-scale study of LLM-based Linux kernel vulnerability reproduction. For this purpose, we develop K-Repro, an LLM-based agentic system equipped with controlled code-browsing, virtual machine management, interaction, and debugging capabilities. Using kernel security patches as input, K-Repro automates end-to-end bug reproduction of N-day vulnerabilities in the Linux kernel. On a dataset of 100…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSecurity and Verification in Computing · Advanced Malware Detection Techniques · Web Application Security Vulnerabilities