MFC-EQ: Mean-Field Control with Envelope Q-Learning for Moving Decentralized Agents in Formation
Qiushi Lin, Hang Ma

TL;DR
This paper introduces MFC-EQ, a scalable decentralized learning framework for multi-agent formation control that balances collision avoidance and formation maintenance, outperforming existing methods in complex, dynamic scenarios.
Contribution
The paper presents MFC-EQ, a novel mean-field control with envelop Q-learning approach enabling scalable, adaptable, and preference-agnostic multi-agent formation planning under partial observation.
Findings
MFC-EQ outperforms state-of-the-art centralized baselines.
It effectively manages dynamic formation changes.
The method scales well with increasing number of agents.
Abstract
We study a decentralized version of Moving Agents in Formation (MAiF), a variant of Multi-Agent Path Finding aiming to plan collision-free paths for multiple agents with the dual objectives of reaching their goals quickly while maintaining a desired formation. The agents must balance these objectives under conditions of partial observation and limited communication. The formation maintenance depends on the joint state of all agents, whose dimensionality increases exponentially with the number of agents, rendering the learning process intractable. Additionally, learning a single policy that can accommodate different linear preferences for these two objectives presents a significant challenge. In this paper, we propose Mean-Field Control with Envelop -learning (MFC-EQ), a scalable and adaptable learning framework for this bi-objective multi-agent problem. We approximate the dynamics of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Neural Networks and Applications · Distributed Control Multi-Agent Systems
