BARReL: Bottleneck Attention for Adversarial Robustness in Vision-Based   Reinforcement Learning

Eugene Bykovets; Yannick Metz; Mennatallah El-Assady; Daniel A. Keim,; Joachim M. Buhmann

arXiv:2208.10481·cs.LG·August 23, 2022

BARReL: Bottleneck Attention for Adversarial Robustness in Vision-Based Reinforcement Learning

Eugene Bykovets, Yannick Metz, Mennatallah El-Assady, Daniel A. Keim,, Joachim M. Buhmann

PDF

Open Access

TL;DR

This paper explores how Bottleneck Attention Modules (BAM) can enhance the robustness of vision-based reinforcement learning agents against adversarial attacks by focusing on salient regions.

Contribution

It introduces the use of BAM in CNN architectures for improved adversarial robustness in vision-based RL, demonstrating increased resilience during inference.

Findings

01

BAM improves robustness against gradient-based adversarial attacks.

02

Attention maps help recover salient activations, enhancing decision reliability.

03

BAM-enhanced models perform better across multiple RL environments.

Abstract

Robustness to adversarial perturbations has been explored in many areas of computer vision. This robustness is particularly relevant in vision-based reinforcement learning, as the actions of autonomous agents might be safety-critic or impactful in the real world. We investigate the susceptibility of vision-based reinforcement learning agents to gradient-based adversarial attacks and evaluate a potential defense. We observe that Bottleneck Attention Modules (BAM) included in CNN architectures can act as potential tools to increase robustness against adversarial attacks. We show how learned attention maps can be used to recover activations of a convolutional layer by restricting the spatial activations to salient regions. Across a number of RL environments, BAM-enhanced architectures show increased robustness during inference. Finally, we discuss potential future research directions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning