Deceptive Sequential Decision-Making via Regularized Policy Optimization

Yerin Kim; Alexander Benvenuti; Bo Chen; Mustafa Karabag; Abhishek Kulkarni; Nathaniel D. Bastian; Ufuk Topcu; Matthew Hale

arXiv:2501.18803·cs.LG·January 22, 2026

Deceptive Sequential Decision-Making via Regularized Policy Optimization

Yerin Kim, Alexander Benvenuti, Bo Chen, Mustafa Karabag, Abhishek Kulkarni, Nathaniel D. Bastian, Ufuk Topcu, Matthew Hale

PDF

Open Access

TL;DR

This paper introduces a framework for autonomous systems to deceive adversaries by actively misrepresenting their reward functions through three novel regularization strategies, maintaining high performance while misleading observers.

Contribution

It proposes three new deception strategies—diversionary, targeted, and equivocal—for policy optimization in Markov decision processes to mislead adversaries using inverse reinforcement learning.

Findings

01

All deception strategies successfully mislead adversaries.

02

Deceptive policies retain at least 98% of optimal reward.

03

The framework is validated in multi-agent scenarios.

Abstract

Autonomous systems are increasingly expected to operate in the presence of adversaries, though adversaries may infer sensitive information simply by observing a system. Therefore, present a deceptive sequential decision-making framework that not only conceals sensitive information, but actively misleads adversaries about it. We model autonomous systems as Markov decision processes, with adversaries using inverse reinforcement learning to recover reward functions. To counter them, we present three regularization strategies for policy synthesis problems that actively deceive an adversary about a system's reward. ``Diversionary deception'' leads an adversary to draw any false conclusion about the system's reward function. ``Targeted deception'' leads an adversary to draw a specific false conclusion about the system's reward function. ``Equivocal deception'' leads an adversary to infer that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Distributed Sensor Networks and Detection Algorithms · Game Theory and Applications