Adaptive Skills, Adaptive Partitions (ASAP)

Daniel J. Mankowitz; Timothy A. Mann; Shie Mannor

arXiv:1602.03351·cs.LG·June 8, 2016·22 cites

Adaptive Skills, Adaptive Partitions (ASAP)

Daniel J. Mankowitz, Timothy A. Mann, Shie Mannor

PDF

Open Access

TL;DR

The ASAP framework enables simultaneous learning of skills and their application locations, facilitating scalable, lifelong learning and efficient adaptation to new tasks with less experience.

Contribution

This paper introduces the ASAP framework that learns both skills and their application regions, a novel approach for scalable and adaptable lifelong learning.

Findings

01

ASAP converges to a local optimum.

02

ASAP effectively reuses skills across tasks.

03

ASAP reduces experience needed for new tasks.

Abstract

We introduce the Adaptive Skills, Adaptive Partitions (ASAP) framework that (1) learns skills (i.e., temporally extended actions or options) as well as (2) where to apply them. We believe that both (1) and (2) are necessary for a truly general skill learning framework, which is a key building block needed to scale up to lifelong learning agents. The ASAP framework can also solve related new tasks simply by adapting where it applies its existing learned skills. We prove that ASAP converges to a local optimum under natural conditions. Finally, our experimental results, which include a RoboCup domain, demonstrate the ability of ASAP to learn where to reuse skills as well as solve multiple tasks with considerably less experience than solving each task from scratch.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Scheduling and Optimization Algorithms · Optimization and Search Problems