Revolutionizing Cyber Threat Detection with Large Language Models: A   privacy-preserving BERT-based Lightweight Model for IoT/IIoT Devices

Mohamed Amine Ferrag; Mthandazo Ndhlovu; Norbert Tihanyi; Lucas C.; Cordeiro; Merouane Debbah; Thierry Lestable; Narinderjit Singh Thandi

arXiv:2306.14263·cs.CR·February 9, 2024·23 cites

Revolutionizing Cyber Threat Detection with Large Language Models: A privacy-preserving BERT-based Lightweight Model for IoT/IIoT Devices

Mohamed Amine Ferrag, Mthandazo Ndhlovu, Norbert Tihanyi, Lucas C., Cordeiro, Merouane Debbah, Thierry Lestable, Narinderjit Singh Thandi

PDF

Open Access

TL;DR

This paper introduces SecurityBERT, a privacy-preserving, lightweight BERT-based model for IoT cyber threat detection, achieving high accuracy and efficiency suitable for resource-constrained devices.

Contribution

The paper presents a novel privacy-preserving encoding technique and a lightweight BERT-based architecture tailored for IoT cybersecurity, outperforming existing methods.

Findings

01

SecurityBERT achieved 98.2% accuracy in attack detection.

02

The model has a size of 16.7MB and inference time under 0.15 seconds.

03

Outperforms traditional ML and DL approaches in IoT threat detection.

Abstract

The field of Natural Language Processing (NLP) is currently undergoing a revolutionary transformation driven by the power of pre-trained Large Language Models (LLMs) based on groundbreaking Transformer architectures. As the frequency and diversity of cybersecurity attacks continue to rise, the importance of incident detection has significantly increased. IoT devices are expanding rapidly, resulting in a growing need for efficient techniques to autonomously identify network-based attacks in IoT networks with both high precision and minimal computational requirements. This paper presents SecurityBERT, a novel architecture that leverages the Bidirectional Encoder Representations from Transformers (BERT) model for cyber threat detection in IoT networks. During the training of SecurityBERT, we incorporated a novel privacy-preserving encoding technique called Privacy-Preserving Fixed-Length…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection · Advanced Malware Detection Techniques · Digital and Cyber Forensics

MethodsPosition-Wise Feed-Forward Layer · Label Smoothing · Absolute Position Encodings · Byte Pair Encoding · Transformer · Multi-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Attention Dropout