Matching options to tasks using Option-Indexed Hierarchical   Reinforcement Learning

Kushal Chauhan; Soumya Chatterjee; Akash Reddy; Balaraman Ravindran,; Pradeep Shenoy

arXiv:2206.05750·cs.LG·June 14, 2022

Matching options to tasks using Option-Indexed Hierarchical Reinforcement Learning

Kushal Chauhan, Soumya Chatterjee, Akash Reddy, Balaraman Ravindran,, Pradeep Shenoy

PDF

Open Access

TL;DR

This paper introduces a novel method called Option-Indexed Hierarchical Reinforcement Learning (OI-HRL) that enables zero-shot transfer of pretrained options to new tasks by learning an affinity function between options and environment items, improving efficiency and reuse.

Contribution

The paper proposes a new option indexing approach for hierarchical RL that allows effective reuse of pretrained options across tasks through an affinity function and meta-training, enabling zero-shot generalization.

Findings

01

OI-HRL achieves performance comparable to oracular baselines.

02

OI-HRL shows substantial improvements over baseline methods without option reuse.

03

The approach demonstrates effective zero-shot transfer in simulated environments.

Abstract

The options framework in Hierarchical Reinforcement Learning breaks down overall goals into a combination of options or simpler tasks and associated policies, allowing for abstraction in the action space. Ideally, these options can be reused across different higher-level goals; indeed, such reuse is necessary to realize the vision of a continual learning agent that can effectively leverage its prior experience. Previous approaches have only proposed limited forms of transfer of prelearned options to new task settings. We propose a novel option indexing approach to hierarchical learning (OI-HRL), where we learn an affinity function between options and the items present in the environment. This allows us to effectively reuse a large library of pretrained options, in zero-shot generalization at test time, by restricting goal-directed learning to only those options relevant to the task at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning

MethodsTest