Attributing Emergence in Million-Agent Systems
Ling Tang, Jilin Mei, Qian Chen, Qihan Ren, Linfeng Zhang, Quanshi Zhang, Jing Shao, Xia Hu, Dongrui Liu

TL;DR
This paper introduces a scalable attribution method for large multi-agent systems powered by LLMs, enabling analysis of macro emergence at million-agent scale, and demonstrates its importance through empirical and theoretical results.
Contribution
It adapts Aumann--Shapley attribution to million-agent systems, achieving significant computational speedup and revealing fundamental scale-dependent differences in attribution.
Findings
Full-scale attribution shows the long tail and middle tier dominate, unlike small-scale studies.
Small panels attribute most influence to high-follower accounts, misrepresenting macro effects.
An Attribution Scaling Bias theorem proves small-scale attributions cannot match full-scale results.
Abstract
Large language models (LLMs) can simulate human-like reasoning and decision-making in individual agents. LLM-powered multi-agent systems (MAS) combine such agents to simulate population-scale social phenomena such as polarization, information cascades, and market panics. Such studies require attributing macro emergence to individual agents, but existing axiomatic methods scale combinatorially in and have been confined to , while the phenomena they explain occur at . We address this gap by adapting Aumann--Shapley path-integral attribution to LLM-powered MAS at million-agent scale; the resulting method satisfies all four axioms, runs four to five orders of magnitude faster than sampled Shapley on the same hardware. We use this method to test the scale gap empirically: across 14 days of public Bluesky data ( active users), we compute the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
