SCoPE: Evaluating LLMs for Software Vulnerability Detection

Jos\'e Gon\c{c}alves; Tiago Dias; Eva Maia; Isabel Pra\c{c}a

arXiv:2407.14372·cs.SE·July 22, 2024

SCoPE: Evaluating LLMs for Software Vulnerability Detection

Jos\'e Gon\c{c}alves, Tiago Dias, Eva Maia, Isabel Pra\c{c}a

PDF

Open Access

TL;DR

This paper introduces SCoPE, a framework for processing C/C++ code to improve vulnerability detection using LLMs, refining datasets and evaluating model performance with promising results.

Contribution

SCoPE provides a novel code normalization framework that enhances dataset quality and supports effective vulnerability detection with fine-tuned LLMs.

Findings

01

SCoPE identified 905 duplicate functions in the dataset.

02

The best LLM achieved a 53% F1-score in vulnerability detection.

03

Refined datasets improve model training and evaluation.

Abstract

In recent years, code security has become increasingly important, especially with the rise of interconnected technologies. Detecting vulnerabilities early in the software development process has demonstrated numerous benefits. Consequently, the scientific community started using machine learning for automated detection of source code vulnerabilities. This work explores and refines the CVEFixes dataset, which is commonly used to train models for code-related tasks, specifically the C/C++ subset. To this purpose, the Source Code Processing Engine (SCoPE), a framework composed of strategized techniques that can be used to reduce the size and normalize C/C++ functions is presented. The output generated by SCoPE was used to create a new version of CVEFixes. This refined dataset was then employed in a feature representation analysis to assess the effectiveness of the tool's code processing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Reliability and Analysis Research · Web Application Security Vulnerabilities · Software System Performance and Reliability