Interpretable Ransomware Detection Using Hybrid Large Language Models: A Comparative Analysis of BERT, RoBERTa, and DeBERTa Through LIME and SHAP

Elodie Mutombo Ngoie; Mike Nkongolo Wa Nkongolo; Peace Azugo; and Mahmut Tokmak

arXiv:2511.13517·cs.CR·January 21, 2026

Interpretable Ransomware Detection Using Hybrid Large Language Models: A Comparative Analysis of BERT, RoBERTa, and DeBERTa Through LIME and SHAP

Elodie Mutombo Ngoie, Mike Nkongolo Wa Nkongolo, Peace Azugo, and Mahmut Tokmak

PDF

TL;DR

This study compares BERT, RoBERTa, and DeBERTa large language models for ransomware detection, transforming system features into text and applying explainable AI techniques to interpret their decision-making processes.

Contribution

It introduces a novel approach of converting cybersecurity features into text for LLMs and evaluates their interpretability using LIME and SHAP in ransomware detection.

Findings

01

RoBERTa achieved the highest F1-score.

02

BERT relied heavily on file-operation features.

03

DeBERTa was sensitive to financial and network indicators.

Abstract

Ransomware continues to evolve in complexity, making early and explainable detection a critical requirement for modern cybersecurity systems. This study presents a comparative analysis of three Transformer-based Large Language Models (LLMs) (BERT, RoBERTa, and DeBERTa) for ransomware detection using two structured datasets: UGRansome and Process Memory (PM). Since LLMs are primarily designed for natural language processing (NLP), numerical and categorical ransomware features were transformed into textual sequences using KBinsDiscretizer and token-based encoding. This enabled the models to learn behavioural patterns from system activity and network traffic through contextual embeddings. The models were fine-tuned on approximately 2,500 labelled samples and evaluated using accuracy, F1 score, and ROC-AUC. To ensure transparent decision-making in this high-stakes domain, two explainable AI…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.