Towards Unifying Quantitative Security Benchmarking for Multi Agent Systems
Gauri Sharma, Vidhi Kulkarni, Miles King, Ken Huang

TL;DR
This paper introduces a formal attack vector called Agent Cascading Injection, analyzes its properties in multi-agent systems, and emphasizes the need for quantitative benchmarking to evaluate and improve security against cascading trust failures.
Contribution
It formalizes the ACI attack, analyzes its propagation and impact, and proposes a framework for quantitative security benchmarking in multi-agent systems.
Findings
ACI can cause cascading compromises across agents.
Propagation chains and amplification factors are key to understanding ACI.
A benchmarking methodology for multi-agent security is outlined.
Abstract
Evolving AI systems increasingly deploy multi-agent architectures where autonomous agents collaborate, share information, and delegate tasks through developing protocols. This connectivity, while powerful, introduces novel security risks. One such risk is a cascading risk: a breach in one agent can cascade through the system, compromising others by exploiting inter-agent trust. In tandem with OWASP's initiative for an Agentic AI Vulnerability Scoring System we define an attack vector, Agent Cascading Injection, analogous to Agent Impact Chain and Blast Radius, operating across networks of agents. In an ACI attack, a malicious input or tool exploit injected at one agent leads to cascading compromises and amplified downstream effects across agents that trust its outputs. We formalize this attack with an adversarial goal equation and key variables (compromised agent, injected exploit,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
