Multi-Agent Deep Reinforcement Learning For Persistent Monitoring With   Sensing, Communication, and Localization Constraints

Manav Mishra; Prithvi Poddar; Rajat Agarwal; Jingxi Chen; Pratap; Tokekar; P.B. Sujit

arXiv:2109.06831·cs.RO·May 16, 2023

Multi-Agent Deep Reinforcement Learning For Persistent Monitoring With Sensing, Communication, and Localization Constraints

Manav Mishra, Prithvi Poddar, Rajat Agarwal, Jingxi Chen, Pratap, Tokekar, P.B. Sujit

PDF

Open Access

TL;DR

This paper introduces GALOPP, a graph convolution-based deep reinforcement learning method for multi-robot persistent monitoring under sensing, communication, and localization constraints, demonstrating effective policies in simulations and hardware tests.

Contribution

It presents GALOPP, a novel MARL architecture that incorporates localization and communication constraints for persistent monitoring tasks.

Findings

01

GALOPP effectively learns monitoring policies in complex environments.

02

Performance is influenced by communication range, obstacle density, and sensing range.

03

GALOPP outperforms non-RL baselines like greedy and random search methods.

Abstract

Determining multi-robot motion policies for persistently monitoring a region with limited sensing, communication, and localization constraints in non-GPS environments is a challenging problem. To take the localization constraints into account, in this paper, we consider a heterogeneous robotic system consisting of two types of agents: anchor agents with accurate localization capability and auxiliary agents with low localization accuracy. To localize itself, the auxiliary agents must be within the communication range of an {anchor}, directly or indirectly. The robotic team's objective is to minimize environmental uncertainty through persistent monitoring. We propose a multi-agent deep reinforcement learning (MARL) based architecture with graph convolution called Graph Localized Proximal Policy Optimization (GALOPP), which incorporates the limited sensor field-of-view, communication, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · Distributed Control Multi-Agent Systems