Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning
Tianmin Shu, Caiming Xiong, Richard Socher

TL;DR
This paper introduces a hierarchical, interpretable multi-task reinforcement learning framework that enables agents to learn, reuse, and acquire new skills efficiently, with human-readable descriptions guiding skill selection.
Contribution
It proposes a novel hierarchical policy framework with temporal grammar for interpretable multi-task RL, allowing continual skill acquisition and reuse based on human language descriptions.
Findings
Agents effectively reuse skills in Minecraft tasks.
The framework improves learning efficiency for complex multi-task environments.
Agents provide human-interpretable descriptions of their decisions.
Abstract
Learning policies for complex tasks that require multiple different skills is a major challenge in reinforcement learning (RL). It is also a requirement for its deployment in real-world scenarios. This paper proposes a novel framework for efficient multi-task reinforcement learning. Our framework trains agents to employ hierarchical policies that decide when to use a previously learned policy and when to learn a new skill. This enables agents to continually acquire new skills during different stages of training. Each learned task corresponds to a human language description. Because agents can only access previously learned skills through these descriptions, the agent can always provide a human-interpretable description of its choices. In order to help the agent learn the complex temporal dependencies necessary for the hierarchical policy, we provide it with a stochastic temporal grammar…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Fault Detection and Control Systems
