Real-time Adversarial Perturbations against Deep Reinforcement Learning Policies: Attacks and Defenses
Buse G. A. Tekgul, Shelly Wang, Samuel Marchal, N. Asokan

TL;DR
This paper demonstrates that universal adversarial perturbations can effectively and rapidly fool deep reinforcement learning policies in real time, revealing vulnerabilities and proposing detection defenses.
Contribution
It introduces three real-time attack variants using universal adversarial perturbations against DRL, and proposes a detection method for such attacks.
Findings
Universal perturbations fully degrade DRL performance (up to 100%)
Attacks operate faster than 60Hz frame rate (~1.8ms)
Detection method effectively identifies all tested adversarial perturbations
Abstract
Deep reinforcement learning (DRL) is vulnerable to adversarial perturbations. Adversaries can mislead the policies of DRL agents by perturbing the state of the environment observed by the agents. Existing attacks are feasible in principle, but face challenges in practice, either by being too slow to fool DRL policies in real time or by modifying past observations stored in the agent's memory. We show that Universal Adversarial Perturbations (UAP), independent of the individual inputs to which they are applied, can fool DRL policies effectively and in real time. We introduce three attack variants leveraging UAP. Via an extensive evaluation using three Atari 2600 games, we show that our attacks are effective, as they fully degrade the performance of three different DRL agents (up to 100%, even when the bound on the perturbation is as small as 0.01). It is faster than the frame…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Cardiac Arrest and Resuscitation
