Iterative Policy-Space Expansion in Reinforcement Learning

Jan Malte Lichtenberg; \"Ozg\"ur \c{S}im\c{s}ek

arXiv:1912.02532·cs.LG·December 6, 2019

Iterative Policy-Space Expansion in Reinforcement Learning

Jan Malte Lichtenberg, \"Ozg\"ur \c{S}im\c{s}ek

PDF

Open Access

TL;DR

This paper introduces an iterative policy-space expansion method in reinforcement learning, where an agent progressively refines its policy by starting with broad categories and narrowing down, leading to faster learning in complex tasks like Tetris.

Contribution

The paper proposes a novel reinforcement learning algorithm that gradually refines the policy space without external curricula, inspired by human problem-solving strategies.

Findings

01

Faster learning rate in Tetris compared to existing algorithms

02

Effective feature categorization before policy refinement

03

Demonstrates benefits of iterative policy-space expansion

Abstract

Humans and animals solve a difficult problem much more easily when they are presented with a sequence of problems that starts simple and slowly increases in difficulty. We explore this idea in the context of reinforcement learning. Rather than providing the agent with an externally provided curriculum of progressively more difficult tasks, the agent solves a single task utilizing a decreasingly constrained policy space. The algorithm we propose first learns to categorize features into positive and negative before gradually learning a more refined policy. Experimental results in Tetris demonstrate superior learning rate of our approach when compared to existing algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Software Engineering Research