Learning Decentralized Partially Observable Mean Field Control for   Artificial Collective Behavior

Kai Cui; Sascha Hauck; Christian Fabian; Heinz Koeppl

arXiv:2307.06175·cs.LG·February 26, 2024·1 cites

Learning Decentralized Partially Observable Mean Field Control for Artificial Collective Behavior

Kai Cui, Sascha Hauck, Christian Fabian, Heinz Koeppl

PDF

Open Access

TL;DR

This paper introduces a decentralized partially observable mean field control framework for multi-agent reinforcement learning, enabling scalable and effective collective behavior modeling with theoretical guarantees and practical algorithms.

Contribution

It proposes a novel Dec-POMFC model that handles decentralization and partial observability, with theoretical analysis and policy gradient algorithms for multi-agent RL.

Findings

01

Dec-POMFC reduces multi-agent problems to single-agent MDPs.

02

Algorithms achieve performance comparable to state-of-the-art MARL.

03

Kernel methods improve mean field control accuracy.

Abstract

Recent reinforcement learning (RL) methods have achieved success in various domains. However, multi-agent RL (MARL) remains a challenge in terms of decentralization, partial observability and scalability to many agents. Meanwhile, collective behavior requires resolution of the aforementioned challenges, and remains of importance to many state-of-the-art applications such as active matter physics, self-organizing systems, opinion dynamics, and biological or robotic swarms. Here, MARL via mean field control (MFC) offers a potential solution to scalability, but fails to consider decentralized and partially observable systems. In this paper, we enable decentralized behavior of agents under partial information by proposing novel models for decentralized partially observable MFC (Dec-POMFC), a broad class of problems with permutation-invariant agents allowing for reduction to tractable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Age of Information Optimization · Distributed Control Multi-Agent Systems