Towards Optimal Adversarial Robust Reinforcement Learning with Infinity   Measurement Error

Haoran Li; Zicheng Zhang; Wang Luo; Congying Han; Jiayu Lv; Tiande; Guo; Yudong Hu

arXiv:2502.16734·cs.LG·February 25, 2025

Towards Optimal Adversarial Robust Reinforcement Learning with Infinity Measurement Error

Haoran Li, Zicheng Zhang, Wang Luo, Congying Han, Jiayu Lv, Tiande, Guo, Yudong Hu

PDF

Open Access 2 Repos

TL;DR

This paper introduces a new formulation called ISA-MDP that characterizes decision-making under adversarial conditions, proves the existence of an optimal robust policy within this framework, and develops the CAR-RL framework to enhance adversarial robustness in DRL.

Contribution

The paper proposes ISA-MDP to model adversarial robustness, proves the existence of an optimal robust policy, and introduces CAR-RL to improve robustness by optimizing for infinity measurement error.

Findings

01

Existence of deterministic and stationary ORP in ISA-MDP.

02

Improving DRL robustness does not necessarily reduce natural environment performance.

03

CAR-RL achieves superior robustness in value-based and policy-based DRL algorithms.

Abstract

Ensuring the robustness of deep reinforcement learning (DRL) agents against adversarial attacks is critical for their trustworthy deployment. Recent research highlights the challenges of achieving state-adversarial robustness and suggests that an optimal robust policy (ORP) does not always exist, complicating the enforcement of strict robustness constraints. In this paper, we further explore the concept of ORP. We first introduce the Intrinsic State-adversarial Markov Decision Process (ISA-MDP), a novel formulation where adversaries cannot fundamentally alter the intrinsic nature of state observations. ISA-MDP, supported by empirical and theoretical evidence, universally characterizes decision-making under state-adversarial paradigms. We rigorously prove that within ISA-MDP, a deterministic and stationary ORP exists, aligning with the Bellman optimal policy. Our findings theoretically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning