Privileged Reinforcement and Communication Learning for Distributed,   Bandwidth-limited Multi-robot Exploration

Yixiao Ma; Jingsong Liang; Yuhong Cao; Derek Ming Siang Tan; Guillaume; Sartoretti

arXiv:2407.20203·cs.RO·July 30, 2024

Privileged Reinforcement and Communication Learning for Distributed, Bandwidth-limited Multi-robot Exploration

Yixiao Ma, Jingsong Liang, Yuhong Cao, Derek Ming Siang Tan, Guillaume, Sartoretti

PDF

Open Access 1 Repo

TL;DR

This paper introduces a deep reinforcement learning framework that enables multi-robot teams to explore environments efficiently while drastically reducing communication bandwidth by embedding salient information into fixed-sized messages.

Contribution

The work presents a novel privileged reinforcement learning approach with attention mechanisms that significantly cuts communication needs with minimal impact on exploration performance.

Findings

01

Communication reduced by up to 100 times

02

Exploration efficiency drops only by 2.4% in travel distance

03

Effective guidance of policy learning through privileged ground truth knowledge

Abstract

Communication bandwidth is an important consideration in multi-robot exploration, where information exchange among robots is critical. While existing methods typically aim to reduce communication throughput, they either require significant computation or significantly compromise exploration efficiency. In this work, we propose a deep reinforcement learning framework based on communication and privileged reinforcement learning to achieve a significant reduction in bandwidth consumption, while minimally sacrificing exploration efficiency. Specifically, our approach allows robots to learn to embed the most salient information from their individual belief (partial map) over the environment into fixed-sized messages. Robots then reason about their own belief as well as received messages to distributedly explore the environment while avoiding redundant work. In doing so, we employ privileged…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

marmotlab/bandwidth-limited-multi-robot-exploration
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModular Robots and Swarm Intelligence · Distributed Control Multi-Agent Systems · Reinforcement Learning in Robotics