BARReL: Bottleneck Attention for Adversarial Robustness in Vision-Based Reinforcement Learning
Eugene Bykovets, Yannick Metz, Mennatallah El-Assady, Daniel A. Keim,, Joachim M. Buhmann

TL;DR
This paper explores how Bottleneck Attention Modules (BAM) can enhance the robustness of vision-based reinforcement learning agents against adversarial attacks by focusing on salient regions.
Contribution
It introduces the use of BAM in CNN architectures for improved adversarial robustness in vision-based RL, demonstrating increased resilience during inference.
Findings
BAM improves robustness against gradient-based adversarial attacks.
Attention maps help recover salient activations, enhancing decision reliability.
BAM-enhanced models perform better across multiple RL environments.
Abstract
Robustness to adversarial perturbations has been explored in many areas of computer vision. This robustness is particularly relevant in vision-based reinforcement learning, as the actions of autonomous agents might be safety-critic or impactful in the real world. We investigate the susceptibility of vision-based reinforcement learning agents to gradient-based adversarial attacks and evaluate a potential defense. We observe that Bottleneck Attention Modules (BAM) included in CNN architectures can act as potential tools to increase robustness against adversarial attacks. We show how learned attention maps can be used to recover activations of a convolutional layer by restricting the spatial activations to salient regions. Across a number of RL environments, BAM-enhanced architectures show increased robustness during inference. Finally, we discuss potential future research directions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
