ANVIL: Anomaly-based Vulnerability Identification without Labelled Training Data

Weizhou Wang; Eric Liu; Xiangyu Guo; Xiao Hu; Ilya Grishchenko; David Lie

arXiv:2408.16028·cs.CR·June 3, 2025

ANVIL: Anomaly-based Vulnerability Identification without Labelled Training Data

Weizhou Wang, Eric Liu, Xiangyu Guo, Xiao Hu, Ilya Grishchenko, David Lie

PDF

Open Access

TL;DR

ANVIL introduces an anomaly detection approach using LLMs for vulnerability identification in code, outperforming supervised methods and uncovering new vulnerabilities without requiring labeled training data.

Contribution

This paper presents ANVIL, a novel anomaly-based vulnerability detection method leveraging LLMs' reconstruction capabilities, eliminating the need for labeled training data and demonstrating superior performance.

Findings

01

ANVIL outperforms state-of-the-art supervised detectors on PrimeVul dataset.

02

ANVIL achieves up to 2x higher Top-3 accuracy and 75% better Normalized MFR.

03

ANVIL uncovers two previously unknown vulnerabilities when integrated with fuzzers.

Abstract

Supervised-learning-based vulnerability detectors often fall short due to limited labelled training data. In contrast, Large Language Models (LLMs) like GPT-4 are trained on vast unlabelled code corpora, yet perform only marginally better than coin flips when directly prompted to detect vulnerabilities. In this paper, we reframe vulnerability detection as anomaly detection, based on the premise that vulnerable code is rare and thus anomalous relative to patterns learned by LLMs. We introduce ANVIL, which performs a masked code reconstruction task: the LLM reconstructs a masked line of code, and deviations from the original are scored as anomalies. We propose a hybrid anomaly score that combines exact match, cross-entropy loss, prediction confidence, and structural complexity. We evaluate our approach across multiple LLM families, scoring methods, and context sizes, and against…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Network Security and Intrusion Detection · Software System Performance and Reliability

MethodsMeta Face Recognition · Linear Layer · Adam · Layer Normalization · Attention Is All You Need · Position-Wise Feed-Forward Layer · Dense Connections · Residual Connection · Multi-Head Attention · Byte Pair Encoding