Multi-Agent Reinforcement Learning for Visibility-based Persistent Monitoring
Jingxi Chen, Amrish Baskaran, Zhongshun Zhang, Pratap Tokekar

TL;DR
This paper introduces a multi-agent reinforcement learning approach using graph attention and PPO to optimize persistent monitoring by robots with limited views, demonstrating improved policies over baselines.
Contribution
It proposes a novel MA-G-PPO algorithm that effectively coordinates multiple robots for visibility-based monitoring using local and global information.
Findings
MA-G-PPO outperforms non-RL baseline in most scenarios
Sharing information among agents enhances policy effectiveness
Emergent behaviors are observed in the learned policies
Abstract
The Visibility-based Persistent Monitoring (VPM) problem seeks to find a set of trajectories (or controllers) for robots to persistently monitor a changing environment. Each robot has a sensor, such as a camera, with a limited field-of-view that is obstructed by obstacles in the environment. The robots may need to coordinate with each other to ensure no point in the environment is left unmonitored for long periods of time. We model the problem such that there is a penalty that accrues every time step if a point is left unmonitored. However, the dynamics of the penalty are unknown to us. We present a Multi-Agent Reinforcement Learning (MARL) algorithm for the VPM problem. Specifically, we present a Multi-Agent Graph Attention Proximal Policy Optimization (MA-G-PPO) algorithm that takes as input the local observations of all agents combined with a low resolution global map to learn a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Optimization and Search Problems · Reinforcement Learning in Robotics
