Inferring Rewards from Language in Context

Jessy Lin; Daniel Fried; Dan Klein; Anca Dragan

arXiv:2204.02515·cs.CL·April 7, 2022·1 cites

Inferring Rewards from Language in Context

Jessy Lin, Daniel Fried, Dan Klein, Anca Dragan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a model that infers user reward functions from language pragmatically, enabling better action prediction in new contexts by understanding underlying preferences expressed through language.

Contribution

The paper presents a novel approach that directly infers rewards from language pragmatically, improving over traditional methods that separate language-to-action and action-to-reward mappings.

Findings

01

More accurate reward inference from language in new environments

02

Improved prediction of optimal actions in unseen contexts

03

Outperforms previous instruction-following and inverse reinforcement learning methods

Abstract

In classic instruction following, language like "I'd like the JetBlue flight" maps to actions (e.g., selecting that flight). However, language also conveys information about a user's underlying reward function (e.g., a general preference for JetBlue), which can allow a model to carry out desirable actions in new contexts. We present a model that infers rewards from language pragmatically: reasoning about how speakers choose utterances not only to elicit desired actions, but also to reveal information about their preferences. On a new interactive flight-booking task with natural language, our model more accurately infers rewards and predicts optimal actions in unseen environments, in comparison to past work that first maps language to actions (instruction following) and then maps actions to rewards (inverse reinforcement learning).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jlin816/rewards-from-language
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech and dialogue systems · Topic Modeling