Interpretable Ransomware Detection Using Hybrid Large Language Models: A Comparative Analysis of BERT, RoBERTa, and DeBERTa Through LIME and SHAP
Elodie Mutombo Ngoie, Mike Nkongolo Wa Nkongolo, Peace Azugo, and Mahmut Tokmak

TL;DR
This study compares BERT, RoBERTa, and DeBERTa large language models for ransomware detection, transforming system features into text and applying explainable AI techniques to interpret their decision-making processes.
Contribution
It introduces a novel approach of converting cybersecurity features into text for LLMs and evaluates their interpretability using LIME and SHAP in ransomware detection.
Findings
RoBERTa achieved the highest F1-score.
BERT relied heavily on file-operation features.
DeBERTa was sensitive to financial and network indicators.
Abstract
Ransomware continues to evolve in complexity, making early and explainable detection a critical requirement for modern cybersecurity systems. This study presents a comparative analysis of three Transformer-based Large Language Models (LLMs) (BERT, RoBERTa, and DeBERTa) for ransomware detection using two structured datasets: UGRansome and Process Memory (PM). Since LLMs are primarily designed for natural language processing (NLP), numerical and categorical ransomware features were transformed into textual sequences using KBinsDiscretizer and token-based encoding. This enabled the models to learn behavioural patterns from system activity and network traffic through contextual embeddings. The models were fine-tuned on approximately 2,500 labelled samples and evaluated using accuracy, F1 score, and ROC-AUC. To ensure transparent decision-making in this high-stakes domain, two explainable AI…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
