Position: Safety and Fairness in Agentic AI Depend on Interaction Topology, Not on Model Scale or Alignment

Tanav Singh Bajaj; Nikhil Singh; Karan Anand; Eishkaran Singh

arXiv:2605.01147·cs.AI·May 5, 2026

Position: Safety and Fairness in Agentic AI Depend on Interaction Topology, Not on Model Scale or Alignment

Tanav Singh Bajaj, Nikhil Singh, Karan Anand, Eishkaran Singh

PDF

TL;DR

This paper argues that safety and fairness in agentic AI depend on interaction topology rather than model scale or alignment, highlighting topology-driven pathologies like ordering instability and information cascades.

Contribution

It challenges the assumption that safety properties compose from individual models, emphasizing the importance of interaction structure in multi-agent AI systems.

Findings

01

Interaction topology dominates safety outcomes in agentic AI.

02

Scaling models increases consensus and exacerbates topology-driven failures.

03

Traditional model-centric evaluation misses critical safety failure modes.

Abstract

As large language models are increasingly deployed as interacting agents in high-stakes decisions, the AI safety community assumes that safety properties of individual models will compose into safe multi-agent behavior. This position paper argues that this assumption is fundamentally mistaken. In agentic AI, safety is determined by interaction topology, not model weights. When agents deliberate sequentially or aggregate via parallel voting with a judge, the structure of information flow and decision coupling dominates outcomes. Evidence across model families and scales reveals three persistent topology-driven pathologies: ordering instability, where system behavior depends primarily on agent sequence; information cascades, where early judgments propagate regardless of correctness; and functional collapse, where systems satisfy fairness metrics while abandoning meaningful risk…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.