Near-Optimal Multi-Agent Learning for Safe Coverage Control
Manish Prajapat, Matteo Turchetta, Melanie N. Zeilinger, Andreas, Krause

TL;DR
This paper introduces algorithms for multi-agent coverage control that learn unknown density functions efficiently while ensuring safety, achieving near-optimal coverage with provable safety guarantees.
Contribution
It proposes MacOpt and SafeMac algorithms that address exploration-exploitation trade-offs and safety in multi-agent coverage, with theoretical guarantees and practical evaluations.
Findings
SafeMac guarantees safety during exploration.
Algorithms achieve near-optimal coverage in finite time.
Outperforms existing methods in synthetic and real tasks.
Abstract
In multi-agent coverage control problems, agents navigate their environment to reach locations that maximize the coverage of some density. In practice, the density is rarely known , further complicating the original NP-hard problem. Moreover, in many applications, agents cannot visit arbitrary locations due to unknown safety constraints. In this paper, we aim to efficiently learn the density to approximately solve the coverage problem while preserving the agents' safety. We first propose a conditionally linear submodular coverage function that facilitates theoretical analysis. Utilizing this structure, we develop MacOpt, a novel algorithm that efficiently trades off the exploration-exploitation dilemma due to partial observability, and show that it achieves sublinear regret. Next, we extend results on single-agent safe exploration to our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Reinforcement Learning in Robotics · Auction Theory and Applications
