A Generative Machine Learning Approach to Policy Optimization in   Pursuit-Evasion Games

Shiva Navabi; Osonde A. Osoba

arXiv:2010.01711·cs.LG·October 15, 2020

A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games

Shiva Navabi, Osonde A. Osoba

PDF

TL;DR

This paper explores using generative machine learning models to optimize pursuit strategies in a pursuit-evasion game, addressing the challenge of planning under uncertainty without explicit dynamic programming solutions.

Contribution

It introduces a data-driven, generative ML approach to learn pursuit policies in pursuit-evasion games where traditional methods are intractable.

Findings

01

Generative ML models effectively learn pursuit policies.

02

The approach captures implicit dynamics of pursuit-evasion interactions.

03

Statistical assessments validate the modeling effectiveness.

Abstract

We consider a pursuit-evasion game [11] played between two agents, 'Blue' (the pursuer) and 'Red' (the evader), over $T$ time steps. Red aims to attack Blue's territory. Blue's objective is to intercept Red by time $T$ and thereby limit the success of Red's attack. Blue must plan its pursuit trajectory by choosing parameters that determine its course of movement (speed and angle in our setup) such that it intercepts Red by time $T$ . We show that Blue's path-planning problem in pursuing Red, can be posed as a sequential decision making problem under uncertainty. Blue's unawareness of Red's action policy renders the analytic dynamic programming approach intractable for finding the optimal action policy for Blue. In this work, we are interested in exploring data-driven approaches to the policy optimization problem that Blue faces. We apply generative machine learning (ML) approaches to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.