Architecture Matters for Multi-Agent Security
Ben Hagag, William L. Anderson, Christian Schroeder de Witt, Sarah Scheffler

TL;DR
This paper empirically examines how multi-agent system architecture choices impact the balance between task performance and security vulnerabilities across various environments and configurations.
Contribution
It systematically analyzes the influence of agent roles, communication topology, and memory on security and performance tradeoffs in multi-agent systems.
Findings
Multi-agent architectures are generally more vulnerable than standalone agents.
Attack success rates can be up to 3.8 times higher in certain configurations.
No single architectural design is universally more secure.
Abstract
Multi-agent systems (MAS), composed of networks of two or more autonomous AI agents, have become increasingly popular in production deployments, yet introduce security risks that do not arise in single-agent settings. Even if individual agents exhibit robust security, architectural decisions governing their coordination can create attack surfaces that have not been systematically characterized. In this work, we present an empirical study of how MAS design decisions shape the tradeoff between task performance and attack resistance. Across three agentic environments (browser, desktop, and code) and 13 architectural configurations, we use stagewise evaluations that distinguish planning refusal, execution-stage interception, partial harmful execution, and successful attack completion to study three key design choices: (i) agent roles, which determine how authority and responsibility are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
