Crimson: Empowering Strategic Reasoning in Cybersecurity through Large   Language Models

Jiandong Jin; Bowen Tang; Mingxuan Ma; Xiao Liu; Yunfei Wang; Qingnan; Lai; Jia Yang; Changling Zhou

arXiv:2403.00878·cs.CR·March 5, 2024·1 cites

Crimson: Empowering Strategic Reasoning in Cybersecurity through Large Language Models

Jiandong Jin, Bowen Tang, Mingxuan Ma, Xiao Liu, Yunfei Wang, Qingnan, Lai, Jia Yang, Changling Zhou

PDF

Open Access

TL;DR

Crimson enhances large language models' strategic reasoning in cybersecurity by integrating CVE data with ATT&CK techniques, employing novel training methods, and demonstrating improved performance and reduced errors in threat analysis.

Contribution

The paper introduces Crimson, a system that combines data synthesis, retrieval-aware training, and domain-specific fine-tuning to significantly improve LLMs' cybersecurity reasoning capabilities.

Findings

01

LLMs fine-tuned with Crimson approach GPT-4 performance.

02

Reduced hallucination and errors in LLMs after applying Crimson techniques.

03

Domain-specific embedding models enhance cybersecurity task performance.

Abstract

We introduces Crimson, a system that enhances the strategic reasoning capabilities of Large Language Models (LLMs) within the realm of cybersecurity. By correlating CVEs with MITRE ATT&CK techniques, Crimson advances threat anticipation and strategic defense efforts. Our approach includes defining and evaluating cybersecurity strategic tasks, alongside implementing a comprehensive human-in-the-loop data-synthetic workflow to develop the CVE-to-ATT&CK Mapping (CVEM) dataset. We further enhance LLMs' reasoning abilities through a novel Retrieval-Aware Training (RAT) process and its refined iteration, RAT-R. Our findings demonstrate that an LLM fine-tuned with our techniques, possessing 7 billion parameters, approaches the performance level of GPT-4, showing markedly lower rates of hallucination and errors, and surpassing other models in strategic reasoning tasks. Moreover,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInformation and Cyber Security · Terrorism, Counterterrorism, and Political Violence · Topic Modeling

MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Multi-Head Attention · Layer Normalization · Dropout · Softmax · Dense Connections · Label Smoothing · Adam