Mean-Field Control based Approximation of Multi-Agent Reinforcement Learning in Presence of a Non-decomposable Shared Global State
Washim Uddin Mondal, Vaneet Aggarwal, Satish V. Ukkusuri

TL;DR
This paper demonstrates that Mean Field Control can effectively approximate large-scale Multi-Agent Reinforcement Learning even with a shared global state that is non-decomposable, providing error bounds and a practical algorithm.
Contribution
It extends MFC applicability to settings with shared global states and derives error bounds independent of the global state size, along with a Natural Policy Gradient algorithm.
Findings
Approximation error is $igO(e)$, independent of global state size.
Error reduces to $rac{ ext{sqrt}| ext{X}|}{ ext{sqrt}N}$ under certain conditions.
Proposed algorithm achieves $igO( ext{epsilon}^{-3})$ sample complexity with near-optimal policies.
Abstract
Mean Field Control (MFC) is a powerful approximation tool to solve large-scale Multi-Agent Reinforcement Learning (MARL) problems. However, the success of MFC relies on the presumption that given the local states and actions of all the agents, the next (local) states of the agents evolve conditionally independent of each other. Here we demonstrate that even in a MARL setting where agents share a common global state in addition to their local states evolving conditionally independently (thus introducing a correlation between the state transition processes of individual agents), the MFC can still be applied as a good approximation tool. The global state is assumed to be non-decomposable i.e., it cannot be expressed as a collection of local states of the agents. We compute the approximation error as where $e=\frac{1}{\sqrt{N}}\left[\sqrt{|\mathcal{X}|}…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
