CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for   Benchmarking Large Language Models

Johan Wahr\'eus; Ahmed Mohamed Hussain; and Panos Papadimitratos

arXiv:2501.01335·cs.CR·January 3, 2025

CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models

Johan Wahr\'eus, Ahmed Mohamed Hussain, and Panos Papadimitratos

PDF

Open Access 1 Repo

TL;DR

This paper introduces CySecBench, a large, domain-specific prompt dataset for evaluating LLM jailbreaking in cybersecurity, and demonstrates a prompt obfuscation method that effectively bypasses security measures in commercial models.

Contribution

The paper presents CySecBench, a novel cybersecurity-focused prompt dataset with a systematic generation methodology, and evaluates a new prompt obfuscation approach for jailbreaking LLMs.

Findings

01

Prompt obfuscation achieves 65% success on ChatGPT

02

88% success rate on Gemini, 17% on Claude

03

Outperforms existing methods on AdvBench dataset

Abstract

Numerous studies have investigated methods for jailbreaking Large Language Models (LLMs) to generate harmful content. Typically, these methods are evaluated using datasets of malicious prompts designed to bypass security policies established by LLM providers. However, the generally broad scope and open-ended nature of existing datasets can complicate the assessment of jailbreaking effectiveness, particularly in specific domains, notably cybersecurity. To address this issue, we present and publicly release CySecBench, a comprehensive dataset containing 12662 prompts specifically designed to evaluate jailbreaking techniques in the cybersecurity domain. The dataset is organized into 10 distinct attack-type categories, featuring close-ended prompts to enable a more consistent and accurate assessment of jailbreaking attempts. Furthermore, we detail our methodology for dataset generation and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cysecbench/dataset
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling