Ask Your Humans: Using Human Instructions to Improve Generalization in   Reinforcement Learning

Valerie Chen; Abhinav Gupta; Kenneth Marino

arXiv:2011.00517·cs.LG·September 28, 2021·6 cites

Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning

Valerie Chen, Abhinav Gupta, Kenneth Marino

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a method that uses human-provided natural language instructions and action demonstrations to improve generalization, interpretability, and sample efficiency in multi-task reinforcement learning within a crafting grid world.

Contribution

It presents a novel framework combining language generation and low-level policies, enabling zero-shot generalization and interpretability in complex multi-task RL environments.

Findings

01

Human demonstrations improve performance on complex tasks.

02

Language conditioning enables zero-shot generalization to unseen tasks.

03

The model produces interpretable high-level action descriptions.

Abstract

Complex, multi-task problems have proven to be difficult to solve efficiently in a sparse-reward reinforcement learning setting. In order to be sample efficient, multi-task learning requires reuse and sharing of low-level policies. To facilitate the automatic decomposition of hierarchical tasks, we propose the use of step-by-step human demonstrations in the form of natural language instructions and action trajectories. We introduce a dataset of such demonstrations in a crafting-based grid world. Our model consists of a high-level language generator and low-level policy, conditioned on language. We find that human demonstrations help solve the most complex tasks. We also find that incorporating natural language allows the model to generalize to unseen tasks in a zero-shot setting and to learn quickly from a few demonstrations. Generalization is not only reflected in the actions of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

valeriechen/ask-your-humans
pytorchOfficial

Videos

Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification