Towards Robust Policy: Enhancing Offline Reinforcement Learning with   Adversarial Attacks and Defenses

Thanh Nguyen; Tung M. Luu; Tri Ton; and Chang D. Yoo

arXiv:2405.11206·cs.LG·August 13, 2024·1 cites

Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses

Thanh Nguyen, Tung M. Luu, Tri Ton, and Chang D. Yoo

PDF

Open Access

TL;DR

This paper proposes a framework that improves the robustness of offline reinforcement learning models by applying adversarial attacks and defenses, demonstrating increased resilience against observation perturbations on benchmark datasets.

Contribution

It introduces a novel framework combining multiple adversarial attacks and defenses to enhance offline RL policy robustness, which is less explored in prior work.

Findings

01

Attacks significantly degrade policy performance.

02

Defenses effectively improve robustness.

03

Framework enhances reliability in practical scenarios.

Abstract

Offline reinforcement learning (RL) addresses the challenge of expensive and high-risk data exploration inherent in RL by pre-training policies on vast amounts of offline data, enabling direct deployment or fine-tuning in real-world environments. However, this training paradigm can compromise policy robustness, leading to degraded performance in practical conditions due to observation perturbations or intentional attacks. While adversarial attacks and defenses have been extensively studied in deep learning, their application in offline RL is limited. This paper proposes a framework to enhance the robustness of offline RL models by leveraging advanced adversarial attacks and defenses. The framework attacks the actor and critic components by perturbing observations during training and using adversarial defenses as regularization to enhance the learned policy. Four attacks and two defenses…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Network Security and Intrusion Detection