Evader-Agnostic Team-Based Pursuit Strategies in Partially-Observable Environments

Addison Kalanther; Daniel Bostwick; Chinmay Maheshwari; Shankar Sastry

arXiv:2511.05812·cs.MA·November 13, 2025

Evader-Agnostic Team-Based Pursuit Strategies in Partially-Observable Environments

Addison Kalanther, Daniel Bostwick, Chinmay Maheshwari, Shankar Sastry

PDF

Open Access

TL;DR

This paper introduces a neuro-symbolic, two-phase pursuit strategy for UAV teams in urban environments, combining offline deep reinforcement learning with online opponent classification to improve pursuit success against unknown evaders.

Contribution

It presents a novel two-phase approach integrating deep reinforcement learning and opponent classification for pursuit-evasion in partially observable environments.

Findings

01

Improved average pursuit success against random evaders.

02

Effective two-phase strategy combining offline training and online adaptation.

03

Demonstrated applicability in urban UAV pursuit scenarios.

Abstract

We consider a scenario where a team of two unmanned aerial vehicles (UAVs) pursue an evader UAV within an urban environment. Each agent has a limited view of their environment where buildings can occlude their field-of-view. Additionally, the pursuer team is agnostic about the evader in terms of its initial and final location, and the behavior of the evader. Consequently, the team needs to gather information by searching the environment and then track it to eventually intercept. To solve this multi-player, partially-observable, pursuit-evasion game, we develop a two-phase neuro-symbolic algorithm centered around the principle of bounded rationality. First, we devise an offline approach using deep reinforcement learning to progressively train adversarial policies for the pursuer team against fictitious evaders. This creates $k$ -levels of rationality for each agent in preparation for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGuidance and Control Systems · Adaptive Dynamic Programming Control · Military Defense Systems Analysis