Code Change Intention, Development Artifact and History Vulnerability:   Putting Them Together for Vulnerability Fix Detection by LLM

Xu Yang; Wenhan Zhu; Michael Pacheco; Jiayuan Zhou; Shaowei Wang; Xing; Hu; Kui Liu

arXiv:2501.14983·cs.SE·January 28, 2025

Code Change Intention, Development Artifact and History Vulnerability: Putting Them Together for Vulnerability Fix Detection by LLM

Xu Yang, Wenhan Zhu, Michael Pacheco, Jiayuan Zhou, Shaowei Wang, Xing, Hu, Kui Liu

PDF

Open Access

TL;DR

This paper introduces LLM4VFD, a novel framework utilizing Large Language Models with Chain-of-Thought reasoning and contextual artifact analysis to significantly improve vulnerability fix detection accuracy and provide explanations, surpassing existing methods.

Contribution

The paper presents LLM4VFD, integrating code change analysis, development artifact context, and historical vulnerability data with advanced LLM techniques for enhanced vulnerability fix detection.

Findings

01

LLM4VFD outperforms existing approaches by 68.1%-145.4%.

02

It provides detailed explanations to aid security experts.

03

User studies show improved efficiency in vulnerability fix identification.

Abstract

Detecting vulnerability fix commits in open-source software is crucial for maintaining software security. To help OSS identify vulnerability fix commits, several automated approaches are developed. However, existing approaches like VulFixMiner and CoLeFunDa, focus solely on code changes, neglecting essential context from development artifacts. Tools like Vulcurator, which integrates issue reports, fail to leverage semantic associations between different development artifacts (e.g., pull requests and history vulnerability fixes). Moreover, they miss vulnerability fixes in tangled commits and lack explanations, limiting practical use. Hence to address those limitations, we propose LLM4VFD, a novel framework that leverages Large Language Models (LLMs) enhanced with Chain-of-Thought reasoning and In-Context Learning to improve the accuracy of vulnerability fix detection. LLM4VFD comprises…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWeb Application Security Vulnerabilities · Information and Cyber Security · Software Engineering Research