Learning Decentralized Partially Observable Mean Field Control for Artificial Collective Behavior
Kai Cui, Sascha Hauck, Christian Fabian, Heinz Koeppl

TL;DR
This paper introduces a decentralized partially observable mean field control framework for multi-agent reinforcement learning, enabling scalable and effective collective behavior modeling with theoretical guarantees and practical algorithms.
Contribution
It proposes a novel Dec-POMFC model that handles decentralization and partial observability, with theoretical analysis and policy gradient algorithms for multi-agent RL.
Findings
Dec-POMFC reduces multi-agent problems to single-agent MDPs.
Algorithms achieve performance comparable to state-of-the-art MARL.
Kernel methods improve mean field control accuracy.
Abstract
Recent reinforcement learning (RL) methods have achieved success in various domains. However, multi-agent RL (MARL) remains a challenge in terms of decentralization, partial observability and scalability to many agents. Meanwhile, collective behavior requires resolution of the aforementioned challenges, and remains of importance to many state-of-the-art applications such as active matter physics, self-organizing systems, opinion dynamics, and biological or robotic swarms. Here, MARL via mean field control (MFC) offers a potential solution to scalability, but fails to consider decentralized and partially observable systems. In this paper, we enable decentralized behavior of agents under partial information by proposing novel models for decentralized partially observable MFC (Dec-POMFC), a broad class of problems with permutation-invariant agents allowing for reduction to tractable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Age of Information Optimization · Distributed Control Multi-Agent Systems
