Surveillance Evasion Through Bayesian Reinforcement Learning

Dongping Qi; David Bindel; Alexander Vladimirsky

arXiv:2109.14811·cs.LG·February 24, 2023

Surveillance Evasion Through Bayesian Reinforcement Learning

Dongping Qi, David Bindel, Alexander Vladimirsky

PDF

Open Access 1 Repo

TL;DR

This paper presents a Bayesian reinforcement learning approach for path planning that enables an evader to minimize detection risk in a surveillance environment by learning and adapting to spatial surveillance patterns over multiple episodes.

Contribution

It introduces a novel combination of Gaussian Process regression, Hamilton-Jacobi PDEs, and confidence bounds for continuous surveillance-evading path planning, outperforming traditional methods.

Findings

01

Significant reduction in detection probability compared to baseline algorithms

02

Effective learning of surveillance intensity through Bayesian methods

03

Improved regret metrics demonstrating better performance over episodes

Abstract

We consider a task of surveillance-evading path-planning in a continuous setting. An Evader strives to escape from a 2D domain while minimizing the risk of detection (and immediate capture). The probability of detection is path-dependent and determined by the spatially inhomogeneous surveillance intensity, which is fixed but a priori unknown and gradually learned in the multi-episodic setting. We introduce a Bayesian reinforcement learning algorithm that relies on a Gaussian Process regression (to model the surveillance intensity function based on the information from prior episodes), numerical methods for Hamilton-Jacobi PDEs (to plan the best continuous trajectories based on the current model), and Confidence Bounds (to balance the exploration vs exploitation). We use numerical experiments and regret metrics to highlight the significant advantages of our approach compared to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

eikonal-equation/Bayesian-Surveillance-Evasion
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Advanced Bandit Algorithms Research · Simulation Techniques and Applications

MethodsGaussian Process