Temporal Abstraction in Reinforcement Learning with the Successor Representation
Marlos C. Machado, Andre Barreto, Doina Precup, Michael, Bowling

TL;DR
This paper explores how the successor representation can be used to discover and refine options in reinforcement learning, enabling better temporal abstraction, exploration, and planning.
Contribution
It introduces a framework where successor representation facilitates the discovery and combination of options, creating a cycle of continuous improvement for both options and representations.
Findings
Successor representation aids in discovering useful options for exploration and planning.
Combining learned options via SR enhances exploration without additional learning.
Empirical results demonstrate the effectiveness of SR-based options in various RL tasks.
Abstract
Reasoning at multiple levels of temporal abstraction is one of the key attributes of intelligence. In reinforcement learning, this is often modeled through temporally extended courses of actions called options. Options allow agents to make predictions and to operate at different levels of abstraction within an environment. Nevertheless, approaches based on the options framework often start with the assumption that a reasonable set of options is known beforehand. When this is not the case, there are no definitive answers for which options one should consider. In this paper, we argue that the successor representation (SR), which encodes states based on the pattern of state visitation that follows them, can be seen as a natural substrate for the discovery and use of temporal abstractions. To support our claim, we take a big picture view of recent results, showing how the SR can be used to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Computability, Logic, AI Algorithms · Data Stream Mining Techniques
