Options Discovery with Budgeted Reinforcement Learning

Aur\'elia L\'eon; Ludovic Denoyer

arXiv:1611.06824·cs.LG·February 23, 2017·1 cites

Options Discovery with Budgeted Reinforcement Learning

Aur\'elia L\'eon, Ludovic Denoyer

PDF

Open Access

TL;DR

This paper introduces BONN, a novel neural network model that automatically discovers hierarchical options in reinforcement learning by optimizing a budgeted learning objective, improving policy efficiency.

Contribution

The paper presents BONN, a new model for automatic option discovery in RL that does not rely on predefined options, advancing hierarchical policy learning.

Findings

01

BONN effectively discovers meaningful options in classical RL tasks.

02

The model achieves both quantitative improvements and qualitative insights.

03

Results demonstrate the potential of budgeted learning for hierarchical policy discovery.

Abstract

We consider the problem of learning hierarchical policies for Reinforcement Learning able to discover options, an option corresponding to a sub-policy over a set of primitive actions. Different models have been proposed during the last decade that usually rely on a predefined set of options. We specifically address the problem of automatically discovering options in decision processes. We describe a new learning model called Budgeted Option Neural Network (BONN) able to discover options based on a budgeted learning objective. The BONN model is evaluated on different classical RL problems, demonstrating both quantitative and qualitative interesting results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Data Stream Mining Techniques