Mitigating Adversarial Perturbations for Deep Reinforcement Learning via   Vector Quantization

Tung M. Luu; Thanh Nguyen; Tee Joshua Tian Jin; Sungwoon Kim; and; Chang D. Yoo

arXiv:2410.03376·cs.LG·October 7, 2024

Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization

Tung M. Luu, Thanh Nguyen, Tee Joshua Tian Jin, Sungwoon Kim, and, Chang D. Yoo

PDF

Open Access 1 Repo

TL;DR

This paper introduces a vector quantization-based input transformation method to improve the robustness of deep reinforcement learning agents against adversarial perturbations, offering an efficient and integrable defense mechanism.

Contribution

The paper proposes a novel input transformation using vector quantization to defend RL agents from adversarial attacks, complementing existing adversarial training methods.

Findings

01

VQ-based transformation reduces attack success rate.

02

Method is computationally efficient and easy to integrate.

03

Significant robustness improvement demonstrated across multiple environments.

Abstract

Recent studies reveal that well-performing reinforcement learning (RL) agents in training often lack resilience against adversarial perturbations during deployment. This highlights the importance of building a robust agent before deploying it in the real world. Most prior works focus on developing robust training-based procedures to tackle this problem, including enhancing the robustness of the deep neural network component itself or adversarially training the agent on strong attacks. In this work, we instead study an input transformation-based defense for RL. Specifically, we propose using a variant of vector quantization (VQ) as a transformation for input observations, which is then used to reduce the space of adversarial attacks during testing, resulting in the transformed observations being less affected by attacks. Our method is computationally efficient and seamlessly integrates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tunglm2203/vq_robust_rl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning

MethodsFocus