Learning Human Preferences Over Robot Behavior as Soft Planning   Constraints

Austin Narcomey (1); Nathan Tsoi (1); Ruta Desai (2); Marynel; V\'azquez (1) ((1) Yale University; (2) Meta AI)

arXiv:2403.19795·cs.RO·April 1, 2024·1 cites

Learning Human Preferences Over Robot Behavior as Soft Planning Constraints

Austin Narcomey (1), Nathan Tsoi (1), Ruta Desai (2), Marynel, V\'azquez (1) ((1) Yale University, (2) Meta AI)

PDF

Open Access

TL;DR

This paper introduces a planning-based approach to learning human preferences in robot behavior, distinguishing between required and desired actions, and demonstrates its effectiveness in simulated rearrangement tasks.

Contribution

It proposes a novel formulation encoding preferences as soft planning constraints and a data-driven method for inferring these preferences through user queries.

Findings

01

Effective inference of preferences under noisy user choices

02

Successful application in simulated rearrangement tasks

03

Potential for adaptable planning-based robot behavior

Abstract

Preference learning has long been studied in Human-Robot Interaction (HRI) in order to adapt robot behavior to specific user needs and desires. Typically, human preferences are modeled as a scalar function; however, such a formulation confounds critical considerations on how the robot should behave for a given task, with desired -- but not required -- robot behavior. In this work, we distinguish between such required and desired robot behavior by leveraging a planning framework. Specifically, we propose a novel problem formulation for preference learning in HRI where various types of human preferences are encoded as soft planning constraints. Then, we explore a data-driven method to enable a robot to infer preferences by querying users, which we instantiate in rearrangement tasks in the Habitat 2.0 simulator. We show that the proposed approach is promising at inferring three types of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · AI-based Problem Solving and Planning