Minimalistic Attacks: How Little it Takes to Fool a Deep Reinforcement Learning Policy
Xinghua Qu, Zhu Sun, Yew-Soon Ong, Abhishek Gupta, Pengfei Wei

TL;DR
This paper investigates the vulnerability of deep reinforcement learning policies to minimalistic adversarial attacks, demonstrating that very small perturbations or limited frame modifications can significantly degrade policy performance.
Contribution
It introduces a new framework for minimalistic adversarial attacks in RL, focusing on black-box access, fractional pixel perturbations, and tactical frame selection, revealing their effectiveness.
Findings
0.01% input state modification degrades performance
DQN policy is deceived by 1% frame perturbation
Minimal attacks can significantly fool RL policies
Abstract
Recent studies have revealed that neural network-based policies can be easily fooled by adversarial examples. However, while most prior works analyze the effects of perturbing every pixel of every frame assuming white-box policy access, in this paper we take a more restrictive view towards adversary generation - with the goal of unveiling the limits of a model's vulnerability. In particular, we explore minimalistic attacks by defining three key settings: (1) black-box policy access: where the attacker only has access to the input (state) and output (action probability) of an RL policy; (2) fractional-state adversary: where only several pixels are perturbed, with the extreme case being a single-pixel adversary; and (3) tactically-chanced attack: where only significant frames are tactically chosen to be attacked. We formulate the adversarial attack by accommodating the three key settings…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Reinforcement Learning in Robotics · Advanced Neural Network Applications
MethodsQ-Learning · Dense Connections · Convolution · Deep Q-Network
