MARVEL: Multi-Agent Reinforcement Learning for constrained field-of-View multi-robot Exploration in Large-scale environments
Jimmy Chiun, Shizhe Zhang, Yizhuo Wang, Yuhong Cao, and Guillaume, Sartoretti

TL;DR
MARVEL is a neural framework using multi-agent reinforcement learning to enable teams of robots with limited field-of-view sensors to collaboratively explore large-scale environments efficiently.
Contribution
It introduces a decentralized policy with graph attention networks and novel feature fusion, handling large action spaces and sensor constraints without retraining.
Findings
MARVEL outperforms existing exploration planners in various metrics.
It generalizes well to different team sizes and sensor configurations.
Successfully deployed on real drone hardware in large environments.
Abstract
In multi-robot exploration, a team of mobile robot is tasked with efficiently mapping an unknown environments. While most exploration planners assume omnidirectional sensors like LiDAR, this is impractical for small robots such as drones, where lightweight, directional sensors like cameras may be the only option due to payload constraints. These sensors have a constrained field-of-view (FoV), which adds complexity to the exploration problem, requiring not only optimal robot positioning but also sensor orientation during movement. In this work, we propose MARVEL, a neural framework that leverages graph attention networks, together with novel frontiers and orientation features fusion technique, to develop a collaborative, decentralized policy using multi-agent reinforcement learning (MARL) for robots with constrained FoV. To handle the large action space of viewpoints planning, we further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robotics and Sensor-Based Localization · Robotic Path Planning Algorithms
