Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning
Valerie Chen, Abhinav Gupta, Kenneth Marino

TL;DR
This paper introduces a method that uses human-provided natural language instructions and action demonstrations to improve generalization, interpretability, and sample efficiency in multi-task reinforcement learning within a crafting grid world.
Contribution
It presents a novel framework combining language generation and low-level policies, enabling zero-shot generalization and interpretability in complex multi-task RL environments.
Findings
Human demonstrations improve performance on complex tasks.
Language conditioning enables zero-shot generalization to unseen tasks.
The model produces interpretable high-level action descriptions.
Abstract
Complex, multi-task problems have proven to be difficult to solve efficiently in a sparse-reward reinforcement learning setting. In order to be sample efficient, multi-task learning requires reuse and sharing of low-level policies. To facilitate the automatic decomposition of hierarchical tasks, we propose the use of step-by-step human demonstrations in the form of natural language instructions and action trajectories. We introduce a dataset of such demonstrations in a crafting-based grid world. Our model consists of a high-level language generator and low-level policy, conditioned on language. We find that human demonstrations help solve the most complex tasks. We also find that incorporating natural language allows the model to generalize to unseen tasks in a zero-shot setting and to learn quickly from a few demonstrations. Generalization is not only reflected in the actions of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification
