Hierarchical and Interpretable Skill Acquisition in Multi-task   Reinforcement Learning

Tianmin Shu; Caiming Xiong; Richard Socher

arXiv:1712.07294·cs.AI·December 21, 2017·58 cites

Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning

Tianmin Shu, Caiming Xiong, Richard Socher

PDF

Open Access

TL;DR

This paper introduces a hierarchical, interpretable multi-task reinforcement learning framework that enables agents to learn, reuse, and acquire new skills efficiently, with human-readable descriptions guiding skill selection.

Contribution

It proposes a novel hierarchical policy framework with temporal grammar for interpretable multi-task RL, allowing continual skill acquisition and reuse based on human language descriptions.

Findings

01

Agents effectively reuse skills in Minecraft tasks.

02

The framework improves learning efficiency for complex multi-task environments.

03

Agents provide human-interpretable descriptions of their decisions.

Abstract

Learning policies for complex tasks that require multiple different skills is a major challenge in reinforcement learning (RL). It is also a requirement for its deployment in real-world scenarios. This paper proposes a novel framework for efficient multi-task reinforcement learning. Our framework trains agents to employ hierarchical policies that decide when to use a previously learned policy and when to learn a new skill. This enables agents to continually acquire new skills during different stages of training. Each learned task corresponds to a human language description. Because agents can only access previously learned skills through these descriptions, the agent can always provide a human-interpretable description of its choices. In order to help the agent learn the complex temporal dependencies necessary for the hierarchical policy, we provide it with a stochastic temporal grammar…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Fault Detection and Control Systems