Revolutionizing Cyber Threat Detection with Large Language Models: A privacy-preserving BERT-based Lightweight Model for IoT/IIoT Devices
Mohamed Amine Ferrag, Mthandazo Ndhlovu, Norbert Tihanyi, Lucas C., Cordeiro, Merouane Debbah, Thierry Lestable, Narinderjit Singh Thandi

TL;DR
This paper introduces SecurityBERT, a privacy-preserving, lightweight BERT-based model for IoT cyber threat detection, achieving high accuracy and efficiency suitable for resource-constrained devices.
Contribution
The paper presents a novel privacy-preserving encoding technique and a lightweight BERT-based architecture tailored for IoT cybersecurity, outperforming existing methods.
Findings
SecurityBERT achieved 98.2% accuracy in attack detection.
The model has a size of 16.7MB and inference time under 0.15 seconds.
Outperforms traditional ML and DL approaches in IoT threat detection.
Abstract
The field of Natural Language Processing (NLP) is currently undergoing a revolutionary transformation driven by the power of pre-trained Large Language Models (LLMs) based on groundbreaking Transformer architectures. As the frequency and diversity of cybersecurity attacks continue to rise, the importance of incident detection has significantly increased. IoT devices are expanding rapidly, resulting in a growing need for efficient techniques to autonomously identify network-based attacks in IoT networks with both high precision and minimal computational requirements. This paper presents SecurityBERT, a novel architecture that leverages the Bidirectional Encoder Representations from Transformers (BERT) model for cyber threat detection in IoT networks. During the training of SecurityBERT, we incorporated a novel privacy-preserving encoding technique called Privacy-Preserving Fixed-Length…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Advanced Malware Detection Techniques · Digital and Cyber Forensics
MethodsPosition-Wise Feed-Forward Layer · Label Smoothing · Absolute Position Encodings · Byte Pair Encoding · Transformer · Multi-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Attention Dropout
