Cooperative Patrol Routing: Optimizing Urban Crime Surveillance through Multi-Agent Reinforcement Learning
Juan Palma-Borda, Eduardo Guzm\'an, Mar\'ia-Victoria Belmonte

TL;DR
This paper presents a multi-agent reinforcement learning approach to optimize urban police patrol routes, maximizing crime hotspot coverage with coordinated strategies tested in Malaga, achieving over 90% coverage of high-crime areas.
Contribution
It introduces a decentralized MARL model with a novel coverage index metric for urban patrol route optimization, demonstrating superior performance in crime hotspot coverage.
Findings
Achieved over 90% coverage of top 3% crime hotspots.
Compared multiple MARL algorithms, with VDPPO performing best.
Evaluated the impact of patrol number, starting positions, and information level.
Abstract
The effective design of patrol strategies is a difficult and complex problem, especially in medium and large areas. The objective is to plan, in a coordinated manner, the optimal routes for a set of patrols in a given area, in order to achieve maximum coverage of the area, while also trying to minimize the number of patrols. In this paper, we propose a multi-agent reinforcement learning (MARL) model, based on a decentralized partially observable Markov decision process, to plan unpredictable patrol routes within an urban environment represented as an undirected graph. The model attempts to maximize a target function that characterizes the environment within a given time frame. Our model has been tested to optimize police patrol routes in three medium-sized districts of the city of Malaga. The aim was to maximize surveillance coverage of the most crime-prone areas, based on actual crime…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Evacuation and Crowd Dynamics · Vehicular Ad Hoc Networks (VANETs)
MethodsSparse Evolutionary Training
