Partially Observable Multi-Agent Reinforcement Learning with Information Sharing
Xiangyu Liu, Kaiqing Zhang

TL;DR
This paper proposes a computationally feasible approach for multi-agent reinforcement learning in partially observable environments by leveraging information sharing and approximations, enabling equilibrium and team-optimal solutions.
Contribution
It introduces a quasi-polynomial time algorithm for equilibrium and team-optimal solutions in partially observable multi-agent RL using information sharing and model approximation.
Findings
Quasi-polynomial time algorithms for equilibrium in POSGs.
Sample complexity bounds for the proposed RL algorithm.
Extension to team-optimal solutions in cooperative settings.
Abstract
We study provable multi-agent reinforcement learning (RL) in the general framework of partially observable stochastic games (POSGs). To circumvent the known hardness results and the use of computationally intractable oracles, we advocate leveraging the potential \emph{information-sharing} among agents, a common practice in empirical multi-agent RL, and a standard model for multi-agent control systems with communication. We first establish several computational complexity results to justify the necessity of information-sharing, as well as the observability assumption that has enabled quasi-polynomial time and sample single-agent RL with partial observations, for tractably solving POSGs. Inspired by the inefficiency of planning in the ground-truth model, we then propose to further \emph{approximate} the shared common information to construct an approximate model of the POSG, in which an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuction Theory and Applications · Game Theory and Applications · Reinforcement Learning in Robotics
