Multi-Agent Deep Reinforcement Learning For Persistent Monitoring With Sensing, Communication, and Localization Constraints
Manav Mishra, Prithvi Poddar, Rajat Agarwal, Jingxi Chen, Pratap, Tokekar, P.B. Sujit

TL;DR
This paper introduces GALOPP, a graph convolution-based deep reinforcement learning method for multi-robot persistent monitoring under sensing, communication, and localization constraints, demonstrating effective policies in simulations and hardware tests.
Contribution
It presents GALOPP, a novel MARL architecture that incorporates localization and communication constraints for persistent monitoring tasks.
Findings
GALOPP effectively learns monitoring policies in complex environments.
Performance is influenced by communication range, obstacle density, and sensing range.
GALOPP outperforms non-RL baselines like greedy and random search methods.
Abstract
Determining multi-robot motion policies for persistently monitoring a region with limited sensing, communication, and localization constraints in non-GPS environments is a challenging problem. To take the localization constraints into account, in this paper, we consider a heterogeneous robotic system consisting of two types of agents: anchor agents with accurate localization capability and auxiliary agents with low localization accuracy. To localize itself, the auxiliary agents must be within the communication range of an {anchor}, directly or indirectly. The robotic team's objective is to minimize environmental uncertainty through persistent monitoring. We propose a multi-agent deep reinforcement learning (MARL) based architecture with graph convolution called Graph Localized Proximal Policy Optimization (GALOPP), which incorporates the limited sensor field-of-view, communication, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · Distributed Control Multi-Agent Systems
