Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning   with Encoder-Decoder Model using Action Query

Hidenori Itaya; Tsubasa Hirakawa; Takayoshi Yamashita; Hironobu; Fujiyoshi; Komei Sugiura

arXiv:2306.13879·cs.LG·June 27, 2023

Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning with Encoder-Decoder Model using Action Query

Hidenori Itaya, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu, Fujiyoshi, Komei Sugiura

PDF

Open Access

TL;DR

The paper introduces the Action Q-Transformer, a novel encoder-decoder transformer model for deep reinforcement learning that enhances interpretability through attention visualization while maintaining or improving performance.

Contribution

It proposes a transformer-based DRL model with attention visualization using action queries, improving interpretability without sacrificing performance.

Findings

01

Attention visualization reveals decision-making processes.

02

Achieves higher performance than baseline in some Atari games.

03

Provides detailed analysis of agent behavior in complex tasks.

Abstract

The excellent performance of Transformer in supervised learning has led to growing interest in its potential application to deep reinforcement learning (DRL) to achieve high performance on a wide variety of problems. However, the decision making of a DRL agent is a black box, which greatly hinders the application of the agent to real-world problems. To address this problem, we propose the Action Q-Transformer (AQT), which introduces a transformer encoder-decoder structure to Q-learning based DRL methods. In AQT, the encoder calculates the state value function and the decoder calculates the advantage function to promote the acquisition of different attentions indicating the agent's decision-making. The decoder in AQT utilizes action queries, which represent the information of each action, as queries. This enables us to obtain the attentions for the state value and for each action. By…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI)

MethodsMulti-Head Attention · Attention Is All You Need · Absolute Position Encodings · Linear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Dense Connections · Label Smoothing · Adam · Byte Pair Encoding