A Ground-Truth-Based Evaluation of Vulnerability Detection Across Multiple Ecosystems

Peter Mandl; Paul Mandl; Martin H\"ausl; Maximilian Auch

arXiv:2604.21111·cs.SE·April 24, 2026

A Ground-Truth-Based Evaluation of Vulnerability Detection Across Multiple Ecosystems

Peter Mandl, Paul Mandl, Martin H\"ausl, Maximilian Auch

PDF

TL;DR

This paper presents an empirical evaluation of vulnerability detection tools across multiple ecosystems using a curated ground-truth dataset from OSV, highlighting differences between systems and emphasizing reproducibility.

Contribution

It introduces a systematic, reproducible methodology and an open-source tool for evaluating vulnerability detection across ecosystems using a curated dataset.

Findings

01

Detection results vary systematically between tools.

02

The curated dataset enables consistent cross-ecosystem comparison.

03

Open-source tool supports dataset reconstruction from OSV.

Abstract

Automated vulnerability detection tools are widely used to identify security vulnerabilities in software dependencies. However, the evaluation of such tools remains challenging due to the heterogeneous structure of vulnerability data sources, inconsistent identifier schemes, and ambiguities in version range specifications. In this paper, we present an empirical evaluation of vulnerability detection across multiple software ecosystems using a curated ground-truth dataset derived from the Open Source Vulnerabilities (OSV) database. The dataset explicitly maps vulnerabilities to concrete package versions and enables a systematic comparison of detection results across different tools and services. Since vulnerability databases such as OSV are continuously updated, the dataset used in this study represents a snapshot of the vulnerability landscape at the time of the evaluation. To support…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.