Visibility Optimization for Surveillance-Evasion Games
Louis Ly, Yen-Hsi Richard Tsai

TL;DR
This paper develops computational methods for optimizing visibility in surveillance-evasion games, combining Hamilton-Jacobi equations, local strategies, and deep reinforcement learning to handle complex, multi-agent scenarios.
Contribution
It introduces a novel approach integrating optimal control, local strategies, and deep learning to improve real-time decision-making in surveillance-evasion games.
Findings
Deep neural networks can learn effective strategies for complex surveillance-evasion scenarios.
Monte Carlo tree search and self-play reinforcement learning enable online strategy generation.
The proposed methods scale with increased computational resources and training, improving performance.
Abstract
We consider surveillance-evasion differential games, where a pursuer must try to constantly maintain visibility of a moving evader. The pursuer loses as soon as the evader becomes occluded. Optimal controls for game can be formulated as a Hamilton-Jacobi-Isaac equation. We use an upwind scheme to compute the feedback value function, corresponding to the end-game time of the differential game. Although the value function enables optimal controls, it is prohibitively expensive to compute, even for a single pursuer and single evader on a small grid. We consider a discrete variant of the surveillance-game. We propose two locally optimal strategies based on the static value function for the surveillance-evasion game with multiple pursuers and evaders. We show that Monte Carlo tree search and self-play reinforcement learning can train a deep neural network to generate reasonable strategies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGuidance and Control Systems · Computational Fluid Dynamics and Aerodynamics · Adversarial Robustness in Machine Learning
