CyberLLM-FINDS 2025: Instruction-Tuned Fine-tuning of Domain-Specific LLMs with Retrieval-Augmented Generation and Graph Integration for MITRE Evaluation
Vasanth Iyer, Leonardo Bobadilla, S. S. Iyengar

TL;DR
This paper presents a methodology for fine-tuning domain-specific cybersecurity LLMs using retrieval-augmented generation and graph reasoning, improving threat detection and analysis capabilities within prompt length constraints.
Contribution
It introduces a hybrid fine-tuning and data generation approach, along with a RAG pipeline and graph modules, to enhance LLM performance in cybersecurity tasks and MITRE ATT&CK alignment.
Findings
Hybrid approach improves model deployment efficiency.
Graph modules enhance multi-hop reasoning and TTP coverage.
Retrieval-augmented generation increases recall in threat scenarios.
Abstract
Large Language Models (LLMs) such as Gemma-2B have shown strong performance in various natural language processing tasks. However, general-purpose models often lack the domain expertise required for cybersecurity applications. This work presents a methodology to fine-tune the Gemma-2B model into a domain-specific cybersecurity LLM. We detail the processes of dataset preparation, fine-tuning, and synthetic data generation, along with implications for real-world applications in threat detection, forensic investigation, and attack analysis. Experiments highlight challenges in prompt length distribution during domain-specific fine-tuning. Uneven prompt lengths limit the model's effective use of the context window, constraining local inference to 200-400 tokens despite hardware support for longer sequences. Chain-of-thought styled prompts, paired with quantized weights, yielded the best…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Natural Language Processing Techniques
