Bayesian Ambiguity Contraction-based Adaptive Robust Markov Decision Processes for Adversarial Surveillance Missions
Jimin Choi, Max Z. Li

TL;DR
This paper develops an adaptive robust Markov decision process framework for autonomous surveillance missions with combat aircraft, enabling real-time policy refinement against strategic adversaries while ensuring safety and improving mission success.
Contribution
It introduces a novel adaptive RMDP approach that refines threat models over time, allowing autonomous agents to transition from conservative to aggressive strategies in contested environments.
Findings
Higher mission rewards compared to static planners
Fewer exposure events in simulated adversarial scenarios
Convergence guarantees as threat models contract
Abstract
Collaborative Combat Aircraft (CCAs) are envisioned to enable autonomous Intelligence, Surveillance, and Reconnaissance (ISR) missions in contested environments, where adversaries may act strategically to deceive or evade detection. These missions pose challenges due to model uncertainty and the need for safe, real-time decision-making. Robust Markov Decision Processes (RMDPs) provide worst-case guarantees but are limited by static ambiguity sets that capture initial uncertainty without adapting to new observations. This paper presents an adaptive RMDP framework tailored to ISR missions with CCAs. We introduce a mission-specific formulation in which aircraft alternate between movement and sensing states. Adversarial tactics are modeled as a finite set of transition kernels, each capturing assumptions about how adversarial sensing or environmental conditions affect rewards. Our approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Military Defense Systems Analysis · Reinforcement Learning in Robotics
