CyberLLM-FINDS 2025: Instruction-Tuned Fine-tuning of Domain-Specific LLMs with Retrieval-Augmented Generation and Graph Integration for MITRE Evaluation

Vasanth Iyer; Leonardo Bobadilla; S. S. Iyengar

arXiv:2601.06779·cs.CR·January 13, 2026

CyberLLM-FINDS 2025: Instruction-Tuned Fine-tuning of Domain-Specific LLMs with Retrieval-Augmented Generation and Graph Integration for MITRE Evaluation

Vasanth Iyer, Leonardo Bobadilla, S. S. Iyengar

PDF

Open Access

TL;DR

This paper presents a methodology for fine-tuning domain-specific cybersecurity LLMs using retrieval-augmented generation and graph reasoning, improving threat detection and analysis capabilities within prompt length constraints.

Contribution

It introduces a hybrid fine-tuning and data generation approach, along with a RAG pipeline and graph modules, to enhance LLM performance in cybersecurity tasks and MITRE ATT&CK alignment.

Findings

01

Hybrid approach improves model deployment efficiency.

02

Graph modules enhance multi-hop reasoning and TTP coverage.

03

Retrieval-augmented generation increases recall in threat scenarios.

Abstract

Large Language Models (LLMs) such as Gemma-2B have shown strong performance in various natural language processing tasks. However, general-purpose models often lack the domain expertise required for cybersecurity applications. This work presents a methodology to fine-tune the Gemma-2B model into a domain-specific cybersecurity LLM. We detail the processes of dataset preparation, fine-tuning, and synthetic data generation, along with implications for real-world applications in threat detection, forensic investigation, and attack analysis. Experiments highlight challenges in prompt length distribution during domain-specific fine-tuning. Uneven prompt lengths limit the model's effective use of the context window, constraining local inference to 200-400 tokens despite hardware support for longer sequences. Chain-of-thought styled prompts, paired with quantized weights, yielded the best…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Natural Language Processing Techniques