Neural architecture impact on identifying temporally extended   Reinforcement Learning tasks

Victor Vadakechirayath George

arXiv:2310.03161·cs.LG·October 6, 2023

Neural architecture impact on identifying temporally extended Reinforcement Learning tasks

Victor Vadakechirayath George

PDF

Open Access

TL;DR

This paper explores attention-based neural architectures, including Vision Transformers, for reinforcement learning in Atari games, enhancing interpretability and efficiency compared to previous models.

Contribution

Introduces attention-based models, including a Vision Transformer architecture, for RL that improve interpretability and training efficiency in Atari environments.

Findings

01

Attention models enable visualization of agent perception.

02

Vision Transformer-based RL models train faster and use fewer resources.

03

Models perform well on Atari-2600 games.

Abstract

Inspired by recent developments in attention models for image classification and natural language processing, we present various Attention based architectures in reinforcement learning (RL) domain, capable of performing well on OpenAI Gym Atari-2600 game suite. In spite of the recent success of Deep Reinforcement learning techniques in various fields like robotics, gaming and healthcare, they suffer from a major drawback that neural networks are difficult to interpret. We try to get around this problem with the help of Attention based models. In Attention based models, extracting and overlaying of attention map onto images allows for direct observation of information used by agent to select actions and easier interpretation of logic behind the chosen actions. Our models in addition to playing well on gym-Atari environments, also provide insights on how agent perceives its environment.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsMulti-Head Attention · Attention Is All You Need · Dropout · Dense Connections · Vision Transformer · Linear Layer · Label Smoothing · Adam · Absolute Position Encodings · Residual Connection