A Compression-Inspired Framework for Macro Discovery

Francisco M. Garcia; Bruno C. da Silva; and Philip S. Thomas

arXiv:1711.09048·cs.AI·February 26, 2019·2 cites

A Compression-Inspired Framework for Macro Discovery

Francisco M. Garcia, Bruno C. da Silva, and Philip S. Thomas

PDF

Open Access

TL;DR

This paper introduces a framework that uses trajectory compression to discover macros, enabling reinforcement learning agents to leverage past experience for faster adaptation to new, related tasks.

Contribution

It presents a novel three-step macro discovery framework based on compression, enhancing transfer learning in reinforcement learning by identifying and utilizing recurrent trajectory patterns.

Findings

01

Macros improve learning speed in new tasks

02

The framework effectively identifies diverse useful macros

03

Enhanced policies outperform primitive-only approaches

Abstract

In this paper we consider the problem of how a reinforcement learning agent tasked with solving a set of related Markov decision processes can use knowledge acquired early in its lifetime to improve its ability to more rapidly solve novel, but related, tasks. One way of exploiting this experience is by identifying recurrent patterns in trajectories obtained from well-performing policies. We propose a three-step framework in which an agent 1) generates a set of candidate open-loop macros by compressing trajectories drawn from near-optimal policies; 2) evaluates the value of each macro; and 3) selects a maximally diverse subset of macros that spans the space of policies typically required for solving the set of related tasks. Our experiments show that extending the original primitive action-set of the agent with the identified macros allows it to more rapidly learn an optimal policy in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Machine Learning and Data Classification