Temporal Abstraction in Reinforcement Learning with the Successor   Representation

Marlos C. Machado; Andre Barreto; Doina Precup; Michael; Bowling

arXiv:2110.05740·cs.LG·April 13, 2023·1 cites

Temporal Abstraction in Reinforcement Learning with the Successor Representation

Marlos C. Machado, Andre Barreto, Doina Precup, Michael, Bowling

PDF

Open Access

TL;DR

This paper explores how the successor representation can be used to discover and refine options in reinforcement learning, enabling better temporal abstraction, exploration, and planning.

Contribution

It introduces a framework where successor representation facilitates the discovery and combination of options, creating a cycle of continuous improvement for both options and representations.

Findings

01

Successor representation aids in discovering useful options for exploration and planning.

02

Combining learned options via SR enhances exploration without additional learning.

03

Empirical results demonstrate the effectiveness of SR-based options in various RL tasks.

Abstract

Reasoning at multiple levels of temporal abstraction is one of the key attributes of intelligence. In reinforcement learning, this is often modeled through temporally extended courses of actions called options. Options allow agents to make predictions and to operate at different levels of abstraction within an environment. Nevertheless, approaches based on the options framework often start with the assumption that a reasonable set of options is known beforehand. When this is not the case, there are no definitive answers for which options one should consider. In this paper, we argue that the successor representation (SR), which encodes states based on the pattern of state visitation that follows them, can be seen as a natural substrate for the discovery and use of temporal abstractions. To support our claim, we take a big picture view of recent results, showing how the SR can be used to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Computability, Logic, AI Algorithms · Data Stream Mining Techniques