Options of Interest: Temporal Abstraction with Interest Functions

Khimya Khetarpal; Martin Klissarov; Maxime Chevalier-Boisvert,; Pierre-Luc Bacon; Doina Precup

arXiv:2001.00271·cs.LG·January 3, 2020

Options of Interest: Temporal Abstraction with Interest Functions

Khimya Khetarpal, Martin Klissarov, Maxime Chevalier-Boisvert,, Pierre-Luc Bacon, Doina Precup

PDF

Open Access 2 Repos

TL;DR

This paper introduces interest functions for temporal abstraction in reinforcement learning, enabling better option discovery and interpretability through a gradient-based learning algorithm and new architecture.

Contribution

It generalizes initiation sets using interest functions, proposes a gradient-based learning algorithm, and develops an interest-option-critic architecture for improved temporal abstraction.

Findings

01

Interest functions improve option learning in discrete and continuous environments.

02

The approach yields interpretable and reusable temporal abstractions.

03

Quantitative and qualitative results demonstrate the method's effectiveness.

Abstract

Temporal abstraction refers to the ability of an agent to use behaviours of controllers which act for a limited, variable amount of time. The options framework describes such behaviours as consisting of a subset of states in which they can initiate, an internal policy and a stochastic termination condition. However, much of the subsequent work on option discovery has ignored the initiation set, because of difficulty in learning it from data. We provide a generalization of initiation sets suitable for general function approximation, by defining an interest function associated with an option. We derive a gradient-based learning algorithm for interest functions, leading to a new interest-option-critic architecture. We investigate how interest functions can be leveraged to learn interpretable and reusable temporal abstractions. We demonstrate the efficacy of the proposed approach through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Control Systems Optimization · Auction Theory and Applications