Beyond Detection: Leveraging Large Language Models for Cyber Attack Prediction in IoT Networks
Alaeddine Diaf, Abdelaziz Amara Korba, Nour Elislem Karabadji, and, Yacine Ghamri-Doudane

TL;DR
This paper introduces a proactive IoT attack prediction framework that combines large language models with LSTM networks, significantly improving early detection accuracy of cyber threats in IoT networks.
Contribution
It presents a novel integration of LLMs and LSTM for proactive attack prediction, advancing beyond reactive detection methods in IoT cybersecurity.
Findings
Achieved 98% overall prediction accuracy on CICIoT2023 dataset.
Demonstrated improved early detection capabilities over traditional methods.
Validated the effectiveness of combining LLMs with LSTM in cybersecurity.
Abstract
In recent years, numerous large-scale cyberattacks have exploited Internet of Things (IoT) devices, a phenomenon that is expected to escalate with the continuing proliferation of IoT technology. Despite considerable efforts in attack detection, intrusion detection systems remain mostly reactive, responding to specific patterns or observed anomalies. This work proposes a proactive approach to anticipate and mitigate malicious activities before they cause damage. This paper proposes a novel network intrusion prediction framework that combines Large Language Models (LLMs) with Long Short Term Memory (LSTM) networks. The framework incorporates two LLMs in a feedback loop: a fine-tuned Generative Pre-trained Transformer (GPT) model for predicting network traffic and a fine-tuned Bidirectional Encoder Representations from Transformers (BERT) for evaluating the predicted traffic. The LSTM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Advanced Malware Detection Techniques · Misinformation and Its Impacts
MethodsLinear Layer · Adam · Layer Normalization · Attention Is All You Need · Position-Wise Feed-Forward Layer · Dense Connections · Tanh Activation · Residual Connection · Multi-Head Attention · Byte Pair Encoding
