Option Transfer and SMDP Abstraction with Successor Features

Dongge Han; Sebastian Tschiatschek

arXiv:2110.09196·cs.LG·June 9, 2022

Option Transfer and SMDP Abstraction with Successor Features

Dongge Han, Sebastian Tschiatschek

PDF

Open Access

TL;DR

This paper introduces a successor feature-based abstraction scheme in reinforcement learning that enables transfer of options across environments and improves planning efficiency by jointly considering state and temporal abstractions.

Contribution

We propose a novel successor feature-based abstraction method that facilitates transfer of options and state aggregation in reinforcement learning.

Findings

01

Effective transfer of options across different environments.

02

Improved planning efficiency with transferred options.

03

Joint state and temporal abstraction enhances generalisation.

Abstract

Abstraction plays an important role in the generalisation of knowledge and skills and is key to sample efficient learning. In this work, we study joint temporal and state abstraction in reinforcement learning, where temporally-extended actions in the form of options induce temporal abstractions, while aggregation of similar states with respect to abstract options induces state abstractions. Many existing abstraction schemes ignore the interplay of state and temporal abstraction. Consequently, the considered option policies often cannot be directly transferred to new environments due to changes in the state space and transition dynamics. To address this issue, we propose a novel abstraction scheme building on successor features. This includes an algorithm for transferring abstract options across different environments and a state abstraction mechanism that allows us to perform efficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Multi-Agent Systems and Negotiation