A Local Information Aggregation based Multi-Agent Reinforcement Learning for Robot Swarm Dynamic Task Allocation
Yang Lv, Jinlong Lei, Peng Yi

TL;DR
This paper presents a novel decentralized multi-agent reinforcement learning framework with local information aggregation, significantly improving dynamic task allocation efficiency and adaptability in robot swarms operating in changing environments.
Contribution
The introduction of the LIA_MADDPG algorithm that combines local information aggregation with centralized training and distributed execution for improved multi-robot task allocation.
Findings
LIA module enhances performance of MARL methods.
LIA_MADDPG outperforms six conventional algorithms.
Demonstrates superior scalability and adaptability.
Abstract
In this paper, we explore how to optimize task allocation for robot swarms in dynamic environments, emphasizing the necessity of formulating robust, flexible, and scalable strategies for robot cooperation. We introduce a novel framework using a decentralized partially observable Markov decision process (Dec_POMDP), specifically designed for distributed robot swarm networks. At the core of our methodology is the Local Information Aggregation Multi-Agent Deep Deterministic Policy Gradient (LIA_MADDPG) algorithm, which merges centralized training with distributed execution (CTDE). During the centralized training phase, a local information aggregation (LIA) module is meticulously designed to gather critical data from neighboring robots, enhancing decision-making efficiency. In the distributed execution phase, a strategy improvement method is proposed to dynamically adjust task allocation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Distributed Control Multi-Agent Systems
