TL;DR
This paper introduces an information-theoretic framework to detect and analyze higher-order emergent structures in multi-agent language models, demonstrating how prompt design can steer collective behavior.
Contribution
It develops a data-driven method using partial information decomposition to identify and localize emergence in multi-agent LLM systems, distinguishing genuine synergy from spurious correlations.
Findings
Groups with personas show identity-linked differentiation and goal-directed complementarity.
The framework is robust across different measures and not explained by simple coordination or dynamics.
Prompt design can steer systems from mere aggregation to higher-order collective behavior.
Abstract
When are multi-agent LLM systems merely a collection of individual agents versus an integrated collective with higher-order structure? We introduce an information-theoretic framework to test -- in a purely data-driven way -- whether multi-agent systems show signs of higher-order structure. This information decomposition lets us measure whether dynamical emergence is present in multi-agent LLM systems, localize it, and distinguish spurious temporal coupling from performance-relevant cross-agent synergy. We implement a practical criterion and an emergence capacity criterion operationalized as partial information decomposition of time-delayed mutual information (TDMI). We apply our framework to experiments using a simple guessing game without direct agent communication and minimal group-level feedback with three randomized interventions. Groups in the control condition exhibit strong…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
