Position: Safety and Fairness in Agentic AI Depend on Interaction Topology, Not on Model Scale or Alignment
Tanav Singh Bajaj, Nikhil Singh, Karan Anand, Eishkaran Singh

TL;DR
This paper argues that safety and fairness in agentic AI depend on interaction topology rather than model scale or alignment, highlighting topology-driven pathologies like ordering instability and information cascades.
Contribution
It challenges the assumption that safety properties compose from individual models, emphasizing the importance of interaction structure in multi-agent AI systems.
Findings
Interaction topology dominates safety outcomes in agentic AI.
Scaling models increases consensus and exacerbates topology-driven failures.
Traditional model-centric evaluation misses critical safety failure modes.
Abstract
As large language models are increasingly deployed as interacting agents in high-stakes decisions, the AI safety community assumes that safety properties of individual models will compose into safe multi-agent behavior. This position paper argues that this assumption is fundamentally mistaken. In agentic AI, safety is determined by interaction topology, not model weights. When agents deliberate sequentially or aggregate via parallel voting with a judge, the structure of information flow and decision coupling dominates outcomes. Evidence across model families and scales reveals three persistent topology-driven pathologies: ordering instability, where system behavior depends primarily on agent sequence; information cascades, where early judgments propagate regardless of correctness; and functional collapse, where systems satisfy fairness metrics while abandoning meaningful risk…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
