Context-Specific Representation Abstraction for Deep Option Learning

Marwa Abdulhai; Dong-Ki Kim; Matthew Riemer; Miao Liu; Gerald Tesauro,; Jonathan P. How

arXiv:2109.09876·cs.LG·April 26, 2022

Context-Specific Representation Abstraction for Deep Option Learning

Marwa Abdulhai, Dong-Ki Kim, Matthew Riemer, Miao Liu, Gerald Tesauro,, Jonathan P. How

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces CRADOL, a framework that enhances deep option learning by learning factored belief state representations, reducing policy search space, and improving sample efficiency in complex partially observable environments.

Contribution

CRADOL is a novel framework that combines temporal and context-specific representation abstraction to improve hierarchical reinforcement learning efficiency.

Findings

01

CRADOL significantly outperforms baselines in sample efficiency.

02

It effectively reduces the policy search space.

03

Demonstrates robustness in partially observable environments.

Abstract

Hierarchical reinforcement learning has focused on discovering temporally extended actions, such as options, that can provide benefits in problems requiring extensive exploration. One promising approach that learns these options end-to-end is the option-critic (OC) framework. We examine and show in this paper that OC does not decompose a problem into simpler sub-problems, but instead increases the size of the search over policy space with each option considering the entire state space during learning. This issue can result in practical limitations of this method, including sample inefficient learning. To address this problem, we introduce Context-Specific Representation Abstraction for Deep Option Learning (CRADOL), a new framework that considers both temporal abstraction and context-specific representation abstraction to effectively reduce the size of the search over policy space.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

maximilianigl/rl-msol
pytorchOfficial

Videos

Context-Specific Representation Abstraction for Deep Option Learning· underline

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Explainable Artificial Intelligence (XAI)

MethodsTest