Matching options to tasks using Option-Indexed Hierarchical Reinforcement Learning
Kushal Chauhan, Soumya Chatterjee, Akash Reddy, Balaraman Ravindran,, Pradeep Shenoy

TL;DR
This paper introduces a novel method called Option-Indexed Hierarchical Reinforcement Learning (OI-HRL) that enables zero-shot transfer of pretrained options to new tasks by learning an affinity function between options and environment items, improving efficiency and reuse.
Contribution
The paper proposes a new option indexing approach for hierarchical RL that allows effective reuse of pretrained options across tasks through an affinity function and meta-training, enabling zero-shot generalization.
Findings
OI-HRL achieves performance comparable to oracular baselines.
OI-HRL shows substantial improvements over baseline methods without option reuse.
The approach demonstrates effective zero-shot transfer in simulated environments.
Abstract
The options framework in Hierarchical Reinforcement Learning breaks down overall goals into a combination of options or simpler tasks and associated policies, allowing for abstraction in the action space. Ideally, these options can be reused across different higher-level goals; indeed, such reuse is necessary to realize the vision of a continual learning agent that can effectively leverage its prior experience. Previous approaches have only proposed limited forms of transfer of prelearned options to new task settings. We propose a novel option indexing approach to hierarchical learning (OI-HRL), where we learn an affinity function between options and the items present in the environment. This allows us to effectively reuse a large library of pretrained options, in zero-shot generalization at test time, by restricting goal-directed learning to only those options relevant to the task at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning
MethodsTest
