A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games
Shiva Navabi, Osonde A. Osoba

TL;DR
This paper explores using generative machine learning models to optimize pursuit strategies in a pursuit-evasion game, addressing the challenge of planning under uncertainty without explicit dynamic programming solutions.
Contribution
It introduces a data-driven, generative ML approach to learn pursuit policies in pursuit-evasion games where traditional methods are intractable.
Findings
Generative ML models effectively learn pursuit policies.
The approach captures implicit dynamics of pursuit-evasion interactions.
Statistical assessments validate the modeling effectiveness.
Abstract
We consider a pursuit-evasion game [11] played between two agents, 'Blue' (the pursuer) and 'Red' (the evader), over time steps. Red aims to attack Blue's territory. Blue's objective is to intercept Red by time and thereby limit the success of Red's attack. Blue must plan its pursuit trajectory by choosing parameters that determine its course of movement (speed and angle in our setup) such that it intercepts Red by time . We show that Blue's path-planning problem in pursuing Red, can be posed as a sequential decision making problem under uncertainty. Blue's unawareness of Red's action policy renders the analytic dynamic programming approach intractable for finding the optimal action policy for Blue. In this work, we are interested in exploring data-driven approaches to the policy optimization problem that Blue faces. We apply generative machine learning (ML) approaches to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
