From Language to Goals: Inverse Reinforcement Learning for Vision-Based   Instruction Following

Justin Fu; Anoop Korattikara; Sergey Levine; Sergio Guadarrama

arXiv:1902.07742·cs.LG·February 22, 2019·67 cites

From Language to Goals: Inverse Reinforcement Learning for Vision-Based Instruction Following

Justin Fu, Anoop Korattikara, Sergey Levine, Sergio Guadarrama

PDF

Open Access

TL;DR

This paper introduces language-conditioned reward learning (LC-RL), enabling machines to understand natural language commands as reward functions, which improves transferability to new tasks and environments in vision-based instruction following.

Contribution

The paper proposes LC-RL, a novel method that grounds language commands as reward functions using deep neural networks, enhancing transferability over language-conditioned policies.

Findings

01

LC-RL learns transferable reward functions for new tasks.

02

Language-conditioned rewards outperform policies in transfer scenarios.

03

Model demonstrates effectiveness in high-dimensional visual environments.

Abstract

Reinforcement learning is a promising framework for solving control problems, but its use in practical situations is hampered by the fact that reward functions are often difficult to engineer. Specifying goals and tasks for autonomous machines, such as robots, is a significant challenge: conventionally, reward functions and goal states have been used to communicate objectives. But people can communicate objectives to each other simply by describing or demonstrating them. How can we build learning algorithms that will allow us to tell machines what we want them to do? In this work, we investigate the problem of grounding language commands as reward functions using inverse reinforcement learning, and argue that language-conditioned rewards are more transferable than language-conditioned policies to new environments. We propose language-conditioned reward learning (LC-RL), which grounds…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications