Optimizing Algorithms From Pairwise User Preferences

Leonid Keselman; Katherine Shih; Martial Hebert; Aaron Steinfeld

arXiv:2308.04571·cs.RO·August 10, 2023

Optimizing Algorithms From Pairwise User Preferences

Leonid Keselman, Katherine Shih, Martial Hebert, Aaron Steinfeld

PDF

Open Access

TL;DR

This paper introduces SortCMA, a method for optimizing algorithm parameters based on pairwise user preferences, especially useful when explicit metrics are unavailable, demonstrated on robot sensor tuning and social navigation.

Contribution

The paper presents SortCMA, a novel approach that optimizes parameters using pairwise preferences without modeling explicit rewards, suitable for high-dimensional problems.

Findings

01

Successfully optimized a depth sensor without ground truth

02

Improved robot social navigation aligning with user preferences

03

User study validated effectiveness of the approach

Abstract

Typical black-box optimization approaches in robotics focus on learning from metric scores. However, that is not always possible, as not all developers have ground truth available. Learning appropriate robot behavior in human-centric contexts often requires querying users, who typically cannot provide precise metric scores. Existing approaches leverage human feedback in an attempt to model an implicit reward function; however, this reward may be difficult or impossible to effectively capture. In this work, we introduce SortCMA to optimize algorithm parameter configurations in high dimensions based on pairwise user preferences. SortCMA efficiently and robustly leverages user input to find parameter sets without directly modeling a reward. We apply this method to tuning a commercial depth sensor without ground truth, and to robot social navigation, which involves highly complex…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Modular Robots and Swarm Intelligence · Robotics and Sensor-Based Localization

MethodsFocus