Dual-Mandate Patrols: Multi-Armed Bandits for Green Security

Lily Xu; Elizabeth Bondi; Fei Fang; Andrew Perrault; Kai Wang; Milind; Tambe

arXiv:2009.06560·cs.LG·April 29, 2024

Dual-Mandate Patrols: Multi-Armed Bandits for Green Security

Lily Xu, Elizabeth Bondi, Fei Fang, Andrew Perrault, Kai Wang, Milind, Tambe

PDF

Open Access 2 Repos

TL;DR

This paper introduces LIZARD, a multi-armed bandit algorithm tailored for green security patrols, balancing exploration and exploitation to optimize patrol strategies and improve poaching prevention.

Contribution

It develops a no-regret bandit approach that combines Lipschitz continuity and action decomposition, enhancing both short-term and long-term patrol effectiveness.

Findings

01

LIZARD outperforms existing methods on real-world poaching data.

02

The approach guarantees convergence and improves short-term patrol performance.

03

Bridges combinatorial and Lipschitz bandit techniques for better security strategies.

Abstract

Conservation efforts in green security domains to protect wildlife and forests are constrained by the limited availability of defenders (i.e., patrollers), who must patrol vast areas to protect from attackers (e.g., poachers or illegal loggers). Defenders must choose how much time to spend in each region of the protected area, balancing exploration of infrequently visited regions and exploitation of known hotspots. We formulate the problem as a stochastic multi-armed bandit, where each action represents a patrol strategy, enabling us to guarantee the rate of convergence of the patrolling policy. However, a naive bandit approach would compromise short-term performance for long-term optimality, resulting in animals poached and forests destroyed. To speed up performance, we leverage smoothness in the reward function and decomposability of actions. We show a synergy between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Reinforcement Learning in Robotics