VERCATION: Precise Vulnerable Open-source Software Version Identification based on Static Analysis and LLM
Yiran Cheng, Ting Zhang, Lwin Khin Shar, Shouguo Yang, Chaopeng Dong, David Lo, Shichao Lv, Zhiqiang Shi, Limin Sun

TL;DR
VERCATION is a novel method that combines static analysis and large language models to accurately identify vulnerable open-source software versions, improving precision over existing techniques.
Contribution
The paper introduces VERCATION, a new approach that leverages program slicing, LLMs, and advanced code clone detection to precisely identify vulnerable OSS versions, addressing limitations of prior methods.
Findings
Achieves 93.1% F1 score on curated dataset
Detects 202 incorrect vulnerable versions in NVD reports
Outperforms state-of-the-art methods in accuracy
Abstract
Open-source software (OSS) has experienced a surge in popularity, attributed to its collaborative development model and cost-effective nature. However, the adoption of specific software versions in development projects may introduce security risks when these versions bring along vulnerabilities. Current methods of identifying vulnerable versions typically analyze and extract the code features involved in vulnerability patches using static analysis with pre-defined rules. They then use code clone detection to identify the vulnerable versions. These methods are hindered by imprecision due to (1) the exclusion of vulnerability-irrelevant code in the analysis and (2) the inadequacy of code clone detection. This paper presents VERCATION, an approach designed to identify vulnerable versions of OSS written in C/C++. VERCATION combines program slicing with a Large Language Model (LLM) to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSecurity and Verification in Computing · Advanced Malware Detection Techniques · Web Application Security Vulnerabilities
