Toward a Safe Internet of Agents

Juan A. Wibowo; George C. Polyzos

arXiv:2512.00520·cs.MA·April 28, 2026

Toward a Safe Internet of Agents

Juan A. Wibowo, George C. Polyzos

PDF

TL;DR

This paper presents a comprehensive framework for designing safe and secure autonomous AI agents within interconnected systems, emphasizing architectural vulnerabilities and mitigation strategies.

Contribution

It introduces a layered analysis of agentic systems, proposing principles for co-designing safety with capability in the Internet of Agents.

Findings

01

Identifies vulnerabilities at single, multi-agent, and ecosystem levels.

02

Proposes four pillars for secure interoperable multi-agent systems.

03

Highlights the importance of co-designing safety with capability.

Abstract

Autonomous Artificial Intelligence (AI) agents, powered by Large Language Models (LLMs), advance rapidly toward interconnected systems -- an Internet of Agents (IoA). This vision enables complex problem-solving while introducing systemic safety and security risks. Beyond existing threat taxonomies, we provide a principled guide addressing architectural vulnerability sources. We offer a framework for engineering safe agentic systems through bottom-up deconstruction, analyzing each component as a dual-use interface where capability expansion creates attack surface growth. We examine three tiers: (1) Single Agents -- analyzing inherent risks in models, memory, design patterns, tools, and guardrails; (2) Multi-Agent Systems (MAS) -- examining collective behavior components including architectural patterns, communication mechanisms, verification, and system guardrails; and (3) Interoperable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.