Planning with Abstract Learned Models While Learning Transferable Subtasks
John Winder, Stephanie Milani, Matthew Landen, Erebus Oh, Shane Parr,, Shawn Squire, Marie desJardins, Cynthia Matuszek

TL;DR
This paper presents PALM, an algorithm for hierarchical reinforcement learning that learns abstract, modular models for planning, enabling efficient learning and transferability across related tasks.
Contribution
Introduction of PALM, a framework that learns symbolic, abstract models for hierarchical planning and transfer in reinforcement learning.
Findings
PALM effectively integrates planning and execution.
Models learned are modular and transferable.
Experiments show rapid learning of hierarchical models.
Abstract
We introduce an algorithm for model-based hierarchical reinforcement learning to acquire self-contained transition and reward models suitable for probabilistic planning at multiple levels of abstraction. We call this framework Planning with Abstract Learned Models (PALM). By representing subtasks symbolically using a new formal structure, the lifted abstract Markov decision process (L-AMDP), PALM learns models that are independent and modular. Through our experiments, we show how PALM integrates planning and execution, facilitating a rapid and efficient learning of abstract, hierarchical models. We also demonstrate the increased potential for learned models to be transferred to new and related tasks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
