From Prompt Injections to Protocol Exploits: Threats in LLM-Powered AI Agents Workflows

Mohamed Amine Ferrag; Norbert Tihanyi; Djallel Hamouda; Leandros Maglaras; Abderrahmane Lakas; Merouane Debbah

arXiv:2506.23260·cs.CR·December 16, 2025

From Prompt Injections to Protocol Exploits: Threats in LLM-Powered AI Agents Workflows

Mohamed Amine Ferrag, Norbert Tihanyi, Djallel Hamouda, Leandros Maglaras, Abderrahmane Lakas, Merouane Debbah

PDF

TL;DR

This paper presents a comprehensive threat model for LLM-powered AI agents, categorizing over thirty attack techniques and proposing mitigation strategies to enhance security in complex, multi-agent workflows.

Contribution

It introduces the first integrated taxonomy linking input exploits and protocol vulnerabilities in LLM-agent ecosystems, with formal threat formulations and mitigation guidance.

Findings

01

Identified over thirty attack techniques across different system layers.

02

Validated the threat model through expert review and real-world incident mapping.

03

Reviewed existing defenses and proposed mitigation strategies.

Abstract

Autonomous AI agents powered by large language models (LLMs) with structured function-calling interfaces enable real-time data retrieval, computation, and multi-step orchestration. However, the rapid growth of plugins, connectors, and inter-agent protocols has outpaced security practices, leading to brittle integrations that rely on ad-hoc authentication, inconsistent schemas, and weak validation. This survey introduces a unified end-to-end threat model for LLM-agent ecosystems, covering host-to-tool and agent-to-agent communications. We systematically categorize more than thirty attack techniques spanning input manipulation, model compromise, system and privacy attacks, and protocol-level vulnerabilities. For each category, we provide a formal threat formulation defining attacker capabilities, objectives, and affected system layers. Representative examples include Prompt-to-SQL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.