Contrastive Learning from Exploratory Actions: Leveraging Natural   Interactions for Preference Elicitation

Nathaniel Dennler; Stefanos Nikolaidis; Maja Matari\'c

arXiv:2501.01367·cs.RO·January 3, 2025

Contrastive Learning from Exploratory Actions: Leveraging Natural Interactions for Preference Elicitation

Nathaniel Dennler, Stefanos Nikolaidis, Maja Matari\'c

PDF

Open Access

TL;DR

This paper introduces CLEA, a contrastive learning approach that leverages users' exploratory actions to learn meaningful robot behavior features, improving preference elicitation efficiency and quality.

Contribution

The paper presents a novel contrastive learning method that uses exploratory user actions to learn semantically meaningful robot behavior features, reducing labeling effort.

Findings

01

CLEA features outperform self-supervised features in preference elicitation.

02

CLEA improves metrics: completeness, simplicity, minimality, explainability.

03

User exploratory actions provide valuable data for learning behavior features.

Abstract

People have a variety of preferences for how robots behave. To understand and reason about these preferences, robots aim to learn a reward function that describes how aligned robot behaviors are with a user's preferences. Good representations of a robot's behavior can significantly reduce the time and effort required for a user to teach the robot their preferences. Specifying these representations -- what "features" of the robot's behavior matter to users -- remains a difficult problem; Features learned from raw data lack semantic meaning and features learned from user data require users to engage in tedious labeling processes. Our key insight is that users tasked with customizing a robot are intrinsically motivated to produce labels through exploratory search; they explore behaviors that they find interesting and ignore behaviors that are irrelevant. To harness this novel data source…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques

MethodsSparse Evolutionary Training · Contrastive Learning