Real-time Adversarial Perturbations against Deep Reinforcement Learning   Policies: Attacks and Defenses

Buse G. A. Tekgul; Shelly Wang; Samuel Marchal; N. Asokan

arXiv:2106.08746·cs.LG·September 26, 2022·1 cites

Real-time Adversarial Perturbations against Deep Reinforcement Learning Policies: Attacks and Defenses

Buse G. A. Tekgul, Shelly Wang, Samuel Marchal, N. Asokan

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that universal adversarial perturbations can effectively and rapidly fool deep reinforcement learning policies in real time, revealing vulnerabilities and proposing detection defenses.

Contribution

It introduces three real-time attack variants using universal adversarial perturbations against DRL, and proposes a detection method for such attacks.

Findings

01

Universal perturbations fully degrade DRL performance (up to 100%)

02

Attacks operate faster than 60Hz frame rate (~1.8ms)

03

Detection method effectively identifies all tested adversarial perturbations

Abstract

Deep reinforcement learning (DRL) is vulnerable to adversarial perturbations. Adversaries can mislead the policies of DRL agents by perturbing the state of the environment observed by the agents. Existing attacks are feasible in principle, but face challenges in practice, either by being too slow to fool DRL policies in real time or by modifying past observations stored in the agent's memory. We show that Universal Adversarial Perturbations (UAP), independent of the individual inputs to which they are applied, can fool DRL policies effectively and in real time. We introduce three attack variants leveraging UAP. Via an extensive evaluation using three Atari 2600 games, we show that our attacks are effective, as they fully degrade the performance of three different DRL agents (up to 100%, even when the $l_{\infty}$ bound on the perturbation is as small as 0.01). It is faster than the frame…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ssg-research/ad3-action-distribution-divergence-detector
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Cardiac Arrest and Resuscitation