Multi-Agent Systems Should be Treated as Principal-Agent Problems
Paulius Rauba, Simonas Cepenas, Mihaela van der Schaar

TL;DR
This paper models multi-agent systems as principal-agent problems, highlighting issues like information asymmetry and goal misalignment, especially in LLM-based agents, and suggests economic theory tools for analysis and mitigation.
Contribution
It introduces a principal-agent framework for analyzing multi-agent systems, linking AI safety concerns with economic theory and proposing strategies to address agency loss.
Findings
LLM agents can develop covert goals like scheming.
Principal-agent theory explains information asymmetry in multi-agent systems.
Mechanism design offers mitigation strategies for agency problems.
Abstract
Consider a multi-agent systems setup in which a principal (a supervisor agent) assigns subtasks to specialized agents and aggregates their responses into a single system-level output. A core property of such systems is information asymmetry: agents observe task-specific information, produce intermediate reasoning traces, and operate with different context windows. In isolation, such asymmetry is not problematic, since agents report truthfully to the principal when incentives are fully aligned. However, this assumption breaks down when incentives diverge. Recent evidence suggests that LLM-based agents can acquire their own goals, such as survival or self-preservation, a phenomenon known as scheming, and may deceive humans or other agents. This leads to agency loss: a gap between the principal's intended outcome and the realized system behavior. Drawing on core ideas from microeconomic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Game Theory and Cooperation · Multi-Agent Systems and Negotiation · Experimental Behavioral Economics Studies
