Beyond Detection: Leveraging Large Language Models for Cyber Attack   Prediction in IoT Networks

Alaeddine Diaf; Abdelaziz Amara Korba; Nour Elislem Karabadji; and; Yacine Ghamri-Doudane

arXiv:2408.14045·cs.CR·August 27, 2024

Beyond Detection: Leveraging Large Language Models for Cyber Attack Prediction in IoT Networks

Alaeddine Diaf, Abdelaziz Amara Korba, Nour Elislem Karabadji, and, Yacine Ghamri-Doudane

PDF

Open Access

TL;DR

This paper introduces a proactive IoT attack prediction framework that combines large language models with LSTM networks, significantly improving early detection accuracy of cyber threats in IoT networks.

Contribution

It presents a novel integration of LLMs and LSTM for proactive attack prediction, advancing beyond reactive detection methods in IoT cybersecurity.

Findings

01

Achieved 98% overall prediction accuracy on CICIoT2023 dataset.

02

Demonstrated improved early detection capabilities over traditional methods.

03

Validated the effectiveness of combining LLMs with LSTM in cybersecurity.

Abstract

In recent years, numerous large-scale cyberattacks have exploited Internet of Things (IoT) devices, a phenomenon that is expected to escalate with the continuing proliferation of IoT technology. Despite considerable efforts in attack detection, intrusion detection systems remain mostly reactive, responding to specific patterns or observed anomalies. This work proposes a proactive approach to anticipate and mitigate malicious activities before they cause damage. This paper proposes a novel network intrusion prediction framework that combines Large Language Models (LLMs) with Long Short Term Memory (LSTM) networks. The framework incorporates two LLMs in a feedback loop: a fine-tuned Generative Pre-trained Transformer (GPT) model for predicting network traffic and a fine-tuned Bidirectional Encoder Representations from Transformers (BERT) for evaluating the predicted traffic. The LSTM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection · Advanced Malware Detection Techniques · Misinformation and Its Impacts

MethodsLinear Layer · Adam · Layer Normalization · Attention Is All You Need · Position-Wise Feed-Forward Layer · Dense Connections · Tanh Activation · Residual Connection · Multi-Head Attention · Byte Pair Encoding