Temporal Shift Reinforcement Learning

Deepak George Thomas; Tichakorn Wongpiromsarn; Ali Jannesari

arXiv:2109.02145·cs.LG·October 28, 2021

Temporal Shift Reinforcement Learning

Deepak George Thomas, Tichakorn Wongpiromsarn, Ali Jannesari

PDF

1 Repo

TL;DR

Temporal Shift Reinforcement Learning (TSRL) introduces a method that jointly learns temporal and spatial features in DRL without extra parameters, outperforming frame stacking and setting new state-of-the-art results in Atari environments.

Contribution

TSRL is a novel technique that integrates temporal learning into DRL models without additional parameters, improving performance over traditional methods.

Findings

01

TSRL outperforms frame stacking in Atari games.

02

TSRL achieves state-of-the-art results on one Atari environment.

03

The method has potential applications in robotics and sequential decision-making.

Abstract

The function approximators employed by traditional image-based Deep Reinforcement Learning (DRL) algorithms usually lack a temporal learning component and instead focus on learning the spatial component. We propose a technique, Temporal Shift Reinforcement Learning (TSRL), wherein both temporal, as well as spatial components are jointly learned. Moreover, TSRL does not require additional parameters to perform temporal learning. We show that TSRL outperforms the commonly used frame stacking heuristic on both of the Atari environments we test on while beating the SOTA for one of them. This investigation has implications in the robotics as well as sequential decision-making domains.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Deepakgthomas/TSM_RL
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution · Q-Learning · Dense Connections · Deep Q-Network