Inverse Reinforcement Learning with Natural Language Goals

Li Zhou; Kevin Small

arXiv:2008.06924·cs.LG·December 17, 2020·1 cites

Inverse Reinforcement Learning with Natural Language Goals

Li Zhou, Kevin Small

PDF

Open Access 1 Video

TL;DR

This paper introduces an adversarial inverse reinforcement learning method that uses natural language goals to train policies, improving generalization and performance in vision-based instruction following tasks.

Contribution

It proposes a novel adversarial IRL algorithm with a variational goal generator for better generalization of language-conditioned policies.

Findings

01

Outperforms baselines on Room-2-Room dataset

02

Enhances generalization to new goals and environments

03

Demonstrates effective use of natural language for goal specification

Abstract

Humans generally use natural language to communicate task requirements to each other. Ideally, natural language should also be usable for communicating goals to autonomous machines (e.g., robots) to minimize friction in task specification. However, understanding and mapping natural language goals to sequences of states and actions is challenging. Specifically, existing work along these lines has encountered difficulty in generalizing learned policies to new natural language goals and environments. In this paper, we propose a novel adversarial inverse reinforcement learning algorithm to learn a language-conditioned policy and reward function. To improve generalization of the learned policy and reward function, we use a variational goal generator to relabel trajectories and sample diverse goals during training. Our algorithm outperforms multiple baselines by a large margin on a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Inverse Reinforcement Learning with Natural Language Goals· underline

Taxonomy

TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Robot Manipulation and Learning