Challenges of Context and Time in Reinforcement Learning: Introducing   Space Fortress as a Benchmark

Akshat Agarwal; Ryan Hope; Katia Sycara

arXiv:1809.02206·cs.LG·September 10, 2018·5 cites

Challenges of Context and Time in Reinforcement Learning: Introducing Space Fortress as a Benchmark

Akshat Agarwal, Ryan Hope, Katia Sycara

PDF

Open Access 1 Repo

TL;DR

This paper introduces Space Fortress as a new reinforcement learning benchmark that emphasizes the challenges of context shifts and temporal sensitivity, which are often overlooked in existing benchmarks, to better evaluate RL algorithms in real-world scenarios.

Contribution

The paper presents Space Fortress as a novel RL benchmark incorporating context-dependent shifts and temporal sensitivity, and demonstrates the limitations of current algorithms in this environment.

Findings

01

State-of-the-art RL algorithms perform poorly on Space Fortress.

02

Performance issues are linked to insensitivity to context and reward sparsity.

03

Space Fortress can be used to study context and temporal sensitivity in RL.

Abstract

Research in deep reinforcement learning (RL) has coalesced around improving performance on benchmarks like the Arcade Learning Environment. However, these benchmarks conspicuously miss important characteristics like abrupt context-dependent shifts in strategy and temporal sensitivity that are often present in real-world domains. As a result, RL research has not focused on these challenges, resulting in algorithms which do not understand critical changes in context, and have little notion of real world time. To tackle this issue, this paper introduces the game of Space Fortress as a RL benchmark which incorporates these characteristics. We show that existing state-of-the-art RL algorithms are unable to learn to play the Space Fortress game. We then confirm that this poor performance is due to the RL algorithms' context insensitivity and reward sparsity. We also identify independent axes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

agakshat/spacefortress
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Advanced Bandit Algorithms Research