Insights and Current Gaps in Open-Source LLM Vulnerability Scanners: A   Comparative Analysis

Jonathan Brokman; Omer Hofman; Oren Rachmil; Inderjeet Singh; Vikas; Pahuja; Rathina Sabapathy Aishvariya Priya; Amit Giloni; Roman Vainshtein,; Hisashi Kojima

arXiv:2410.16527·cs.CR·November 19, 2024·2 cites

Insights and Current Gaps in Open-Source LLM Vulnerability Scanners: A Comparative Analysis

Jonathan Brokman, Omer Hofman, Oren Rachmil, Inderjeet Singh, Vikas, Pahuja, Rathina Sabapathy Aishvariya Priya, Amit Giloni, Roman Vainshtein,, Hisashi Kojima

PDF

Open Access

TL;DR

This paper compares open-source vulnerability scanners for conversational LLMs, highlighting their features, limitations, and reliability issues, and introduces a labelled dataset to improve future vulnerability detection.

Contribution

It provides a comprehensive comparison of prominent LLM vulnerability scanners, identifies reliability gaps, and offers a labelled dataset to aid future research and development.

Findings

01

Significant reliability issues in current scanners

02

Unifying principles of scanner design identified

03

A preliminary labelled dataset introduced

Abstract

This report presents a comparative analysis of open-source vulnerability scanners for conversational large language models (LLMs). As LLMs become integral to various applications, they also present potential attack surfaces, exposed to security risks such as information leakage and jailbreak attacks. Our study evaluates prominent scanners - Garak, Giskard, PyRIT, and CyberSecEval - that adapt red-teaming practices to expose these vulnerabilities. We detail the distinctive features and practical use of these scanners, outline unifying principles of their design and perform quantitative evaluations to compare them. These evaluations uncover significant reliability issues in detecting successful attacks, highlighting a fundamental gap for future development. Additionally, we contribute a preliminary labelled dataset, which serves as an initial step to bridge this gap. Based on the above,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWeb Application Security Vulnerabilities