Mean-Field Control based Approximation of Multi-Agent Reinforcement   Learning in Presence of a Non-decomposable Shared Global State

Washim Uddin Mondal; Vaneet Aggarwal; Satish V. Ukkusuri

arXiv:2301.06889·cs.LG·May 30, 2023·1 cites

Mean-Field Control based Approximation of Multi-Agent Reinforcement Learning in Presence of a Non-decomposable Shared Global State

Washim Uddin Mondal, Vaneet Aggarwal, Satish V. Ukkusuri

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that Mean Field Control can effectively approximate large-scale Multi-Agent Reinforcement Learning even with a shared global state that is non-decomposable, providing error bounds and a practical algorithm.

Contribution

It extends MFC applicability to settings with shared global states and derives error bounds independent of the global state size, along with a Natural Policy Gradient algorithm.

Findings

01

Approximation error is $igO(e)$, independent of global state size.

02

Error reduces to $rac{ ext{sqrt}| ext{X}|}{ ext{sqrt}N}$ under certain conditions.

03

Proposed algorithm achieves $igO( ext{epsilon}^{-3})$ sample complexity with near-optimal policies.

Abstract

Mean Field Control (MFC) is a powerful approximation tool to solve large-scale Multi-Agent Reinforcement Learning (MARL) problems. However, the success of MFC relies on the presumption that given the local states and actions of all the agents, the next (local) states of the agents evolve conditionally independent of each other. Here we demonstrate that even in a MARL setting where agents share a common global state in addition to their local states evolving conditionally independently (thus introducing a correlation between the state transition processes of individual agents), the MFC can still be applied as a good approximation tool. The global state is assumed to be non-decomposable i.e., it cannot be expressed as a collection of local states of the agents. We compute the approximation error as $O (e)$ where $e=\frac{1}{\sqrt{N}}\left[\sqrt{|\mathcal{X}|}…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://github.itap.purdue.edu/Clan-labs/MeanFieldwithGlobalState
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics