VulInstruct: Teaching LLMs Root-Cause Reasoning for Vulnerability Detection via Security Specifications

Hao Zhu; Jia Li; Cuiyun Gao; Jiaru Qian; Yihong Dong; Huanyu Liu; Lecheng Wang; Ziliang Wang; Xiaolong Hu; Ge Li

arXiv:2511.04014·cs.SE·March 31, 2026

VulInstruct: Teaching LLMs Root-Cause Reasoning for Vulnerability Detection via Security Specifications

Hao Zhu, Jia Li, Cuiyun Gao, Jiaru Qian, Yihong Dong, Huanyu Liu, Lecheng Wang, Ziliang Wang, Xiaolong Hu, Ge Li

PDF

1 Repo

TL;DR

VulInstruct enhances large language models' ability to detect software vulnerabilities by leveraging security specifications extracted from historical data, significantly improving detection performance and discovering new vulnerabilities.

Contribution

The paper introduces VulInstruct, a specification-guided method that enables LLMs to reason about security behaviors, improving vulnerability detection accuracy and uncovering previously unknown flaws.

Findings

01

VulInstruct achieves 45.0% F1-score on PrimeVul, a 32.7% improvement over baselines.

02

It detects 24.3% of vulnerabilities, 2.4 times more than existing methods.

03

Discovered a new high-severity vulnerability (CVE-2025-56538) in production code.

Abstract

Large language models (LLMs) have achieved remarkable progress in code understanding tasks. However, they demonstrate limited performance in vulnerability detection and struggle to distinguish vulnerable code from patched code. We argue that LLMs lack understanding of security specifications -- the expectations about how code should behave to remain safe. When code behavior differs from these expectations, it becomes a potential vulnerability. However, such knowledge is rarely explicit in training data, leaving models unable to reason about security flaws. We propose VulInstruct, a specification-guided approach that systematically extracts security specifications from historical vulnerabilities to detect new ones. VulInstruct constructs a specification knowledge base from two perspectives: (i) General specifications from high-quality patches across projects, capturing fundamental safe…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhuhaopku/VulInstruct-temp
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.