Repeated Deceptive Path Planning against Learnable Observer
Shiyue Cao, Pei Xu, Likun Yang, Lei Cui, Shizhao Yu, Shiyu Zhang, Yongjian Ren, Xiaotang Chen, Kaiqi Huang

TL;DR
This paper introduces a novel framework for deceptive path planning against learnable observers, enabling an agent to adapt and sustain deception over repeated interactions in dynamic environments.
Contribution
It proposes Deceptive Meta Planning (DeMP), a two-level optimization approach that improves adaptation to evolving adversaries in repeated deception scenarios.
Findings
DeMP significantly outperforms existing methods in repeated deception tasks.
Incorporating cross-episode feedback accelerates observer model adaptation.
DeMP maintains competitive path costs while enhancing deception effectiveness.
Abstract
We study the problem of deceptive path planning (DPP), where an agent aims to conceal its true destination from external observers. While existing work assumes static, non-learning observers, real-world adversaries-such as in critical goods transportation or military operations-can adapt by learning from historical trajectories. To address this gap, we introduce Repeated Deceptive Path Planning (RDPP), a new formulation that explicitly models learnable observers. We show that existing DPP methods fail under this setting, as they cannot adapt to evolving adversarial predictions. While incorporating observer previous predictions into updates enables some adaptation, such incremental updates cause accumulative lag that degrades deception. To this end, we propose Deceptive Meta Planning (DeMP), a two-level optimization framework that combines episode-level adaptation, which enables…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
