LLMs in Code Vulnerability Analysis: A Proof of Concept
Shaznin Sultana, Sadia Afreen, Nasir U. Eisty

TL;DR
This paper investigates the use of large language models for automating software vulnerability detection and repair, demonstrating that fine-tuning enhances performance and that code-specific models excel in complex tasks.
Contribution
It provides a comprehensive evaluation of recent LLMs for vulnerability analysis, comparing fine-tuning and prompt-based methods on benchmark datasets.
Findings
Fine-tuning outperforms zero-shot and few-shot approaches.
Code-specialized models excel in complex tasks.
Current metrics are inadequate for measuring repair quality.
Abstract
Context: Traditional software security analysis methods struggle to keep pace with the scale and complexity of modern codebases, requiring intelligent automation to detect, assess, and remediate vulnerabilities more efficiently and accurately. Objective: This paper explores the incorporation of code-specific and general-purpose Large Language Models (LLMs) to automate critical software security tasks, such as identifying vulnerabilities, predicting severity and access complexity, and generating fixes as a proof of concept. Method: We evaluate five pairs of recent LLMs, including both code-based and general-purpose open-source models, on two recognized C/C++ vulnerability datasets, namely Big-Vul and Vul-Repair. Additionally, we compare fine-tuning and prompt-based approaches. Results: The results show that fine-tuning uniformly outperforms both zero-shot and few-shot approaches across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Information and Cyber Security · Web Application Security Vulnerabilities
