RAG-targeted Adversarial Attack on LLM-based Threat Detection and Mitigation Framework
Seif Ikbarieh, Kshitiz Aryal, Maanak Gupta

TL;DR
This paper demonstrates how targeted data poisoning attacks can significantly weaken the effectiveness of LLM-based IoT threat detection systems by corrupting their knowledge base, exposing vulnerabilities in AI-driven cybersecurity frameworks.
Contribution
It introduces a novel adversarial attack method on RAG-based LLM threat detection, highlighting vulnerabilities and providing insights into improving robustness against data poisoning.
Findings
Small perturbations degrade model performance
Attack weakens link between traffic features and attack behavior
Mitigation recommendations become less specific and practical
Abstract
The rapid expansion of the Internet of Things (IoT) is reshaping communication and operational practices across industries, but it also broadens the attack surface and increases susceptibility to security breaches. Artificial Intelligence has become a valuable solution in securing IoT networks, with Large Language Models (LLMs) enabling automated attack behavior analysis and mitigation suggestion in Network Intrusion Detection Systems (NIDS). Despite advancements, the use of LLMs in such systems further expands the attack surface, putting entire networks at risk by introducing vulnerabilities such as prompt injection and data poisoning. In this work, we attack an LLM-based IoT attack analysis and mitigation framework to test its adversarial robustness. We construct an attack description dataset and use it in a targeted data poisoning attack that applies word-level, meaning-preserving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Network Security and Intrusion Detection · Software-Defined Networks and 5G
