Attributing Emergence in Million-Agent Systems

Ling Tang; Jilin Mei; Qian Chen; Qihan Ren; Linfeng Zhang; Quanshi Zhang; Jing Shao; Xia Hu; Dongrui Liu

arXiv:2605.11404·cs.AI·May 13, 2026

Attributing Emergence in Million-Agent Systems

Ling Tang, Jilin Mei, Qian Chen, Qihan Ren, Linfeng Zhang, Quanshi Zhang, Jing Shao, Xia Hu, Dongrui Liu

PDF

TL;DR

This paper introduces a scalable attribution method for large multi-agent systems powered by LLMs, enabling analysis of macro emergence at million-agent scale, and demonstrates its importance through empirical and theoretical results.

Contribution

It adapts Aumann--Shapley attribution to million-agent systems, achieving significant computational speedup and revealing fundamental scale-dependent differences in attribution.

Findings

01

Full-scale attribution shows the long tail and middle tier dominate, unlike small-scale studies.

02

Small panels attribute most influence to high-follower accounts, misrepresenting macro effects.

03

An Attribution Scaling Bias theorem proves small-scale attributions cannot match full-scale results.

Abstract

Large language models (LLMs) can simulate human-like reasoning and decision-making in individual agents. LLM-powered multi-agent systems (MAS) combine such agents to simulate population-scale social phenomena such as polarization, information cascades, and market panics. Such studies require attributing macro emergence to individual agents, but existing axiomatic methods scale combinatorially in $N$ and have been confined to $N ≲ 1 0^{3}$ , while the phenomena they explain occur at $N \geq 1 0^{6}$ . We address this gap by adapting Aumann--Shapley path-integral attribution to LLM-powered MAS at million-agent scale; the resulting method satisfies all four axioms, runs four to five orders of magnitude faster than sampled Shapley on the same hardware. We use this method to test the scale gap empirically: across 14 days of public Bluesky data ( $1, 671, 587$ active users), we compute the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.