COMPASS: Cooperative Multi-Agent Persistent Monitoring using Spatio-Temporal Attention Network
Xingjian Zhang, Yizhuo Wang, Guillaume Sartoretti

TL;DR
COMPASS is a multi-agent reinforcement learning framework that uses spatio-temporal attention and Gaussian Processes to enable decentralized agents to efficiently and persistently monitor multiple moving targets in dynamic environments.
Contribution
We introduce COMPASS, a novel MARL framework with a spatio-temporal attention network and Gaussian Process modeling for persistent multi-target monitoring.
Findings
Outperforms baselines in uncertainty reduction
Achieves higher target coverage
Demonstrates improved coordination efficiency
Abstract
Persistent monitoring of dynamic targets is essential in real-world applications such as disaster response, environmental sensing, and wildlife conservation, where mobile agents must continuously gather information under uncertainty. We propose COMPASS, a multi-agent reinforcement learning (MARL) framework that enables decentralized agents to persistently monitor multiple moving targets efficiently. We model the environment as a graph, where nodes represent spatial locations and edges capture topological proximity, allowing agents to reason over structured layouts and revisit informative regions as needed. Each agent independently selects actions based on a shared spatio-temporal attention network that we design to integrate historical observations and spatial context. We model target dynamics using Gaussian Processes (GPs), which support principled belief updates and enable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Target Tracking and Data Fusion in Sensor Networks · Video Surveillance and Tracking Methods
