Minimalistic Attacks: How Little it Takes to Fool a Deep Reinforcement   Learning Policy

Xinghua Qu; Zhu Sun; Yew-Soon Ong; Abhishek Gupta; Pengfei Wei

arXiv:1911.03849·cs.LG·October 30, 2020·1 cites

Minimalistic Attacks: How Little it Takes to Fool a Deep Reinforcement Learning Policy

Xinghua Qu, Zhu Sun, Yew-Soon Ong, Abhishek Gupta, Pengfei Wei

PDF

Open Access

TL;DR

This paper investigates the vulnerability of deep reinforcement learning policies to minimalistic adversarial attacks, demonstrating that very small perturbations or limited frame modifications can significantly degrade policy performance.

Contribution

It introduces a new framework for minimalistic adversarial attacks in RL, focusing on black-box access, fractional pixel perturbations, and tactical frame selection, revealing their effectiveness.

Findings

01

0.01% input state modification degrades performance

02

DQN policy is deceived by 1% frame perturbation

03

Minimal attacks can significantly fool RL policies

Abstract

Recent studies have revealed that neural network-based policies can be easily fooled by adversarial examples. However, while most prior works analyze the effects of perturbing every pixel of every frame assuming white-box policy access, in this paper we take a more restrictive view towards adversary generation - with the goal of unveiling the limits of a model's vulnerability. In particular, we explore minimalistic attacks by defining three key settings: (1) black-box policy access: where the attacker only has access to the input (state) and output (action probability) of an RL policy; (2) fractional-state adversary: where only several pixels are perturbed, with the extreme case being a single-pixel adversary; and (3) tactically-chanced attack: where only significant frames are tactically chosen to be attacked. We formulate the adversarial attack by accommodating the three key settings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Reinforcement Learning in Robotics · Advanced Neural Network Applications

MethodsQ-Learning · Dense Connections · Convolution · Deep Q-Network