Minimizing Worst-Case Weighted Latency for Multi-Robot Persistent Monitoring: Theory and RL-Based Solutions
Weizhen Wang, Ziheng Wang, Jianping He, Xinping Guan, Xiaoming Duan

TL;DR
This paper develops theoretical and reinforcement learning solutions for multi-robot persistent monitoring on weighted graphs, focusing on minimizing worst-case weighted latency and introducing a new benchmark platform.
Contribution
It introduces tail-performance objectives, establishes their properties, constructs an event-driven MDP, and develops RL methods along with a new benchmark for evaluation.
Findings
RL methods effectively reduce worst-case weighted latency.
The proposed approach outperforms baseline algorithms.
The benchmark supports comprehensive evaluation of monitoring strategies.
Abstract
We study multi-robot persistent monitoring on weighted graphs, where node weights encode monitoring priorities and edge weights encode travel distances. The goal is to design joint robot trajectories that minimize the worst-case weighted latency across all nodes over an infinite time horizon. The widely adopted worst-case latency objective evaluates team performance over the entire time horizon and therefore may fail to distinguish strategies with poor transient behavior but strong asymptotic performance. To address this limitation, we propose a family of tail-performance objectives that generalize the standard objective and study the resulting functional optimization problems. We establish several key theoretical properties, including the existence of optimal strategies, relationships among the proposed objectives and their corresponding optimization problems, approximation by periodic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
