On the Approximation of Cooperative Heterogeneous Multi-Agent   Reinforcement Learning (MARL) using Mean Field Control (MFC)

Washim Uddin Mondal; Mridul Agarwal; Vaneet Aggarwal; and Satish V.; Ukkusuri

arXiv:2109.04024·cs.LG·May 10, 2022·6 cites

On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)

Washim Uddin Mondal, Mridul Agarwal, Vaneet Aggarwal, and Satish V., Ukkusuri

PDF

Open Access 1 Video

TL;DR

This paper establishes approximation guarantees for heterogeneous multi-agent reinforcement learning problems using mean field control, providing error bounds and a convergent policy gradient algorithm.

Contribution

It introduces theoretical bounds for approximating heterogeneous MARL with MFC and proposes a Natural Policy Gradient algorithm with convergence guarantees.

Findings

01

Derived error bounds for three distribution scenarios.

02

Proposed a NPG algorithm with convergence guarantees.

03

Quantified sample complexity for policy convergence.

Abstract

Mean field control (MFC) is an effective way to mitigate the curse of dimensionality of cooperative multi-agent reinforcement learning (MARL) problems. This work considers a collection of $N_{pop}$ heterogeneous agents that can be segregated into $K$ classes such that the $k$ -th class contains $N_{k}$ homogeneous agents. We aim to prove approximation guarantees of the MARL problem for this heterogeneous system by its corresponding MFC problem. We consider three scenarios where the reward and transition dynamics of all agents are respectively taken to be functions of $(1)$ joint state and action distributions across all classes, $(2)$ individual distributions of each class, and $(3)$ marginal distributions of the entire population. We show that, in these cases, the $K$ -class MARL problem can be approximated by MFC with errors given as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Game Theory and Applications