Visibility Optimization for Surveillance-Evasion Games

Louis Ly; Yen-Hsi Richard Tsai

arXiv:2010.09001·cs.AI·March 29, 2022

Visibility Optimization for Surveillance-Evasion Games

Louis Ly, Yen-Hsi Richard Tsai

PDF

Open Access

TL;DR

This paper develops computational methods for optimizing visibility in surveillance-evasion games, combining Hamilton-Jacobi equations, local strategies, and deep reinforcement learning to handle complex, multi-agent scenarios.

Contribution

It introduces a novel approach integrating optimal control, local strategies, and deep learning to improve real-time decision-making in surveillance-evasion games.

Findings

01

Deep neural networks can learn effective strategies for complex surveillance-evasion scenarios.

02

Monte Carlo tree search and self-play reinforcement learning enable online strategy generation.

03

The proposed methods scale with increased computational resources and training, improving performance.

Abstract

We consider surveillance-evasion differential games, where a pursuer must try to constantly maintain visibility of a moving evader. The pursuer loses as soon as the evader becomes occluded. Optimal controls for game can be formulated as a Hamilton-Jacobi-Isaac equation. We use an upwind scheme to compute the feedback value function, corresponding to the end-game time of the differential game. Although the value function enables optimal controls, it is prohibitively expensive to compute, even for a single pursuer and single evader on a small grid. We consider a discrete variant of the surveillance-game. We propose two locally optimal strategies based on the static value function for the surveillance-evasion game with multiple pursuers and evaders. We show that Monte Carlo tree search and self-play reinforcement learning can train a deep neural network to generate reasonable strategies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGuidance and Control Systems · Computational Fluid Dynamics and Aerodynamics · Adversarial Robustness in Machine Learning