Visual Explanation using Attention Mechanism in Actor-Critic-based Deep Reinforcement Learning
Hidenori Itaya, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu, Fujiyoshi, Komei Sugiura

TL;DR
This paper introduces Mask-Attention A3C, an attention mechanism integrated into deep reinforcement learning that visualizes decision-making and improves performance in game environments.
Contribution
It proposes a novel attention mechanism for actor-critic DRL that enables visualization of decision reasons and enhances agent performance.
Findings
Attention maps effectively visualize decision reasons.
The method improves agent performance in Atari games.
Attention mechanism maintains or enhances learning efficiency.
Abstract
Deep reinforcement learning (DRL) has great potential for acquiring the optimal action in complex environments such as games and robot control. However, it is difficult to analyze the decision-making of the agent, i.e., the reasons it selects the action acquired by learning. In this work, we propose Mask-Attention A3C (Mask A3C), which introduces an attention mechanism into Asynchronous Advantage Actor-Critic (A3C), which is an actor-critic-based DRL method, and can analyze the decision-making of an agent in DRL. A3C consists of a feature extractor that extracts features from an image, a policy branch that outputs the policy, and a value branch that outputs the state value. In this method, we focus on the policy and value branches and introduce an attention mechanism into them. The attention mechanism applies a mask processing to the feature maps of each branch using mask-attention that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning
MethodsEntropy Regularization · Convolution · Dense Connections · Softmax · A3C
