Towards Unifying Quantitative Security Benchmarking for Multi Agent Systems

Gauri Sharma; Vidhi Kulkarni; Miles King; Ken Huang

arXiv:2507.21146·cs.CR·July 30, 2025

Towards Unifying Quantitative Security Benchmarking for Multi Agent Systems

Gauri Sharma, Vidhi Kulkarni, Miles King, Ken Huang

PDF

TL;DR

This paper introduces a formal attack vector called Agent Cascading Injection, analyzes its properties in multi-agent systems, and emphasizes the need for quantitative benchmarking to evaluate and improve security against cascading trust failures.

Contribution

It formalizes the ACI attack, analyzes its propagation and impact, and proposes a framework for quantitative security benchmarking in multi-agent systems.

Findings

01

ACI can cause cascading compromises across agents.

02

Propagation chains and amplification factors are key to understanding ACI.

03

A benchmarking methodology for multi-agent security is outlined.

Abstract

Evolving AI systems increasingly deploy multi-agent architectures where autonomous agents collaborate, share information, and delegate tasks through developing protocols. This connectivity, while powerful, introduces novel security risks. One such risk is a cascading risk: a breach in one agent can cascade through the system, compromising others by exploiting inter-agent trust. In tandem with OWASP's initiative for an Agentic AI Vulnerability Scoring System we define an attack vector, Agent Cascading Injection, analogous to Agent Impact Chain and Blast Radius, operating across networks of agents. In an ACI attack, a malicious input or tool exploit injected at one agent leads to cascading compromises and amplified downstream effects across agents that trust its outputs. We formalize this attack with an adversarial goal equation and key variables (compromised agent, injected exploit,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.