Privileged Reinforcement and Communication Learning for Distributed, Bandwidth-limited Multi-robot Exploration
Yixiao Ma, Jingsong Liang, Yuhong Cao, Derek Ming Siang Tan, Guillaume, Sartoretti

TL;DR
This paper introduces a deep reinforcement learning framework that enables multi-robot teams to explore environments efficiently while drastically reducing communication bandwidth by embedding salient information into fixed-sized messages.
Contribution
The work presents a novel privileged reinforcement learning approach with attention mechanisms that significantly cuts communication needs with minimal impact on exploration performance.
Findings
Communication reduced by up to 100 times
Exploration efficiency drops only by 2.4% in travel distance
Effective guidance of policy learning through privileged ground truth knowledge
Abstract
Communication bandwidth is an important consideration in multi-robot exploration, where information exchange among robots is critical. While existing methods typically aim to reduce communication throughput, they either require significant computation or significantly compromise exploration efficiency. In this work, we propose a deep reinforcement learning framework based on communication and privileged reinforcement learning to achieve a significant reduction in bandwidth consumption, while minimally sacrificing exploration efficiency. Specifically, our approach allows robots to learn to embed the most salient information from their individual belief (partial map) over the environment into fixed-sized messages. Robots then reason about their own belief as well as received messages to distributedly explore the environment while avoiding redundant work. In doing so, we employ privileged…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModular Robots and Swarm Intelligence · Distributed Control Multi-Agent Systems · Reinforcement Learning in Robotics
