Socially-Weighted Alignment: A Game-Theoretic Framework for Multi-Agent LLM Systems
Furkan Mumcu, Yasin Yilmaz

TL;DR
This paper introduces Socially-Weighted Alignment (SWA), a game-theoretic framework for multi-agent LLM systems that balances individual objectives and group welfare, leading to stable shared-resource management.
Contribution
It proposes SWA as a novel inference-time method to improve multi-agent LLM coordination without retraining or reinforcement learning.
Findings
SWA induces a phase transition at a critical social weight threshold.
Above the threshold, agents stabilize and avoid congestion.
Empirical validation confirms theoretical predictions.
Abstract
Deploying large language model (LLM) agents in shared environments introduces a fundamental tension between individual alignment and collective stability: locally rational decisions can impose negative externalities that degrade system-level performance. We propose Socially-Weighted Alignment (SWA), a game-theoretic framework that modifies inference-time decision making by interpolating between an agent's private objective and an estimate of group welfare via a social weight . In a shared-resource congestion game with agents and congestion severity , we show that SWA induces a critical threshold above which agents no longer have marginal incentive to increase demand under overload, yielding a phase transition from persistent congestion to stable operation near capacity. We further provide an inference-time algorithmic instantiation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAge of Information Optimization · Ferroelectric and Negative Capacitance Devices · Machine Learning and Algorithms
