TL;DR
This paper explores the use of Multi-Agent Reinforcement Learning for decentralized UAV control in critical data relay tasks, highlighting scalability challenges and providing a benchmark environment.
Contribution
Introduces a family of deterministic games for MARL scalability studies and proposes a baseline policy for UAV data relay tasks.
Findings
Off-the-shelf MARL algorithms perform well with few agents
Scalability issues emerge as agent count increases
Source code and animations are publicly available
Abstract
This work studies the application of Multi-Agent Reinforcement Learning (MARL) to decentralized control of unmanned aerial vehicles to relay a critical data package to a known position. For this purpose, a family of deterministic games is introduced, designed for MARL scaling studies. A robust baseline policy is proposed which restricts agent motion and applies Dijkstra's shortest path algorithm. Computational experiment results show that two off-the-shelf MARL algorithms perform competitively with the baseline for a small number of agents, but face scalability issues as the number of agents increases. Source code and animations are available online at https://github.com/mikapersson/Information-Relaying.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
