LLMs in Code Vulnerability Analysis: A Proof of Concept

Shaznin Sultana; Sadia Afreen; Nasir U. Eisty

arXiv:2601.08691·cs.SE·January 14, 2026

LLMs in Code Vulnerability Analysis: A Proof of Concept

Shaznin Sultana, Sadia Afreen, Nasir U. Eisty

PDF

Open Access

TL;DR

This paper investigates the use of large language models for automating software vulnerability detection and repair, demonstrating that fine-tuning enhances performance and that code-specific models excel in complex tasks.

Contribution

It provides a comprehensive evaluation of recent LLMs for vulnerability analysis, comparing fine-tuning and prompt-based methods on benchmark datasets.

Findings

01

Fine-tuning outperforms zero-shot and few-shot approaches.

02

Code-specialized models excel in complex tasks.

03

Current metrics are inadequate for measuring repair quality.

Abstract

Context: Traditional software security analysis methods struggle to keep pace with the scale and complexity of modern codebases, requiring intelligent automation to detect, assess, and remediate vulnerabilities more efficiently and accurately. Objective: This paper explores the incorporation of code-specific and general-purpose Large Language Models (LLMs) to automate critical software security tasks, such as identifying vulnerabilities, predicting severity and access complexity, and generating fixes as a proof of concept. Method: We evaluate five pairs of recent LLMs, including both code-based and general-purpose open-source models, on two recognized C/C++ vulnerability datasets, namely Big-Vul and Vul-Repair. Additionally, we compare fine-tuning and prompt-based approaches. Results: The results show that fine-tuning uniformly outperforms both zero-shot and few-shot approaches across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Information and Cyber Security · Web Application Security Vulnerabilities