Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents

Christian Schroeder de Witt; Klaudia Krawiecka; Igor Krawczuk; Ben Hagag; William L. Anderson; Peter Belcak; Ben Bucknall; Xiaohong Cai; Ayush Chopra; Doron Cohen; Ron F. Del Rosario; Andis Draguns; Annie Gray; Keren Katz; Vasilios Mavroudis; Jaron Mink; Sumeet Ramesh Motwani; Jonathan Petit; Leif-Sebastian Rembeck; Chandler Smith; John Sotiropoulos; Steven Young; Sarah Scheffler; Mary Llewellyn

arXiv:2505.02077·cs.CR·April 30, 2026·2 cites

Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents

Christian Schroeder de Witt, Klaudia Krawiecka, Igor Krawczuk, Ben Hagag, William L. Anderson, Peter Belcak, Ben Bucknall, Xiaohong Cai, Ayush Chopra, Doron Cohen, Ron F. Del Rosario, Andis Draguns, Annie Gray, Keren Katz, Vasilios Mavroudis, Jaron Mink, Sumeet Ramesh Motwani

PDF

TL;DR

This paper introduces the emerging field of multi-agent security, addressing the unique threats and challenges posed by interacting AI agents across various environments, and proposes a research agenda to enhance system security.

Contribution

It formalizes the threat landscape, offers applications across subfields, and outlines a unified research agenda for securing multi-agent systems.

Findings

01

Taxonomizes threats from interacting AI agents

02

Provides applications for multi-agent security

03

Proposes a research agenda for open challenges

Abstract

AI agents are beginning to interact with each other directly and across internet platforms and physical environments, creating security challenges beyond traditional cybersecurity and AI safety frameworks. Free-form protocols are essential for AI's task generalization but enable new threats like secret collusion and coordinated swarm attacks. Network effects can rapidly spread privacy breaches, disinformation, jailbreaks, and data poisoning, while multi-agent dispersion and stealth optimization help adversaries evade oversight - creating novel persistent threats at a systemic level. Despite their critical importance, these security challenges remain understudied, with research fragmented across disparate fields including AI security, multi-agent learning, complex systems, cybersecurity, game theory, distributed systems, and technical AI governance. We introduce multi-agent security, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.