Toward a Safe Internet of Agents
Juan A. Wibowo, George C. Polyzos

TL;DR
This paper presents a comprehensive framework for designing safe and secure autonomous AI agents within interconnected systems, emphasizing architectural vulnerabilities and mitigation strategies.
Contribution
It introduces a layered analysis of agentic systems, proposing principles for co-designing safety with capability in the Internet of Agents.
Findings
Identifies vulnerabilities at single, multi-agent, and ecosystem levels.
Proposes four pillars for secure interoperable multi-agent systems.
Highlights the importance of co-designing safety with capability.
Abstract
Autonomous Artificial Intelligence (AI) agents, powered by Large Language Models (LLMs), advance rapidly toward interconnected systems -- an Internet of Agents (IoA). This vision enables complex problem-solving while introducing systemic safety and security risks. Beyond existing threat taxonomies, we provide a principled guide addressing architectural vulnerability sources. We offer a framework for engineering safe agentic systems through bottom-up deconstruction, analyzing each component as a dual-use interface where capability expansion creates attack surface growth. We examine three tiers: (1) Single Agents -- analyzing inherent risks in models, memory, design patterns, tools, and guardrails; (2) Multi-Agent Systems (MAS) -- examining collective behavior components including architectural patterns, communication mechanisms, verification, and system guardrails; and (3) Interoperable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
