Bootstrapping Skills

Daniel J. Mankowitz; Timothy A. Mann; Shie Mannor

arXiv:1506.03624·cs.AI·June 12, 2015

Bootstrapping Skills

Daniel J. Mankowitz, Timothy A. Mann, Shie Mannor

PDF

Open Access

TL;DR

This paper introduces Learning Skills via Bootstrapping (LSB), a method that iteratively learns and combines simple, parameterized skills to efficiently solve complex MDPs, outperforming monolithic policy approaches.

Contribution

The paper proposes a novel bootstrapping approach for learning and combining simple skills, enabling effective policy learning in large, complex MDPs.

Findings

01

LSB can learn near-optimal policies through iterative skill bootstrapping.

02

LSB outperforms monolithic policy methods on complex MDPs.

03

The approach is compatible with various RL algorithms.

Abstract

The monolithic approach to policy representation in Markov Decision Processes (MDPs) looks for a single policy that can be represented as a function from states to actions. For the monolithic approach to succeed (and this is not always possible), a complex feature representation is often necessary since the policy is a complex object that has to prescribe what actions to take all over the state space. This is especially true in large domains with complicated dynamics. It is also computationally inefficient to both learn and plan in MDPs using a complex monolithic approach. We present a different approach where we restrict the policy space to policies that can be represented as combinations of simpler, parameterized skills---a type of temporally extended action, with a simple policy representation. We introduce Learning Skills via Bootstrapping (LSB) that can use a broad family of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Robot Manipulation and Learning