CLARIFY: Contrastive Preference Reinforcement Learning for Untangling Ambiguous Queries

Ni Mu; Hao Hu; Xiao Hu; Yiqin Yang; Bo Xu; Qing-Shan Jia

arXiv:2506.00388·cs.LG·June 11, 2025

CLARIFY: Contrastive Preference Reinforcement Learning for Untangling Ambiguous Queries

Ni Mu, Hao Hu, Xiao Hu, Yiqin Yang, Bo Xu, Qing-Shan Jia

PDF

Open Access 1 Repo

TL;DR

CLARIFY introduces a contrastive learning approach to preference-based reinforcement learning, improving query clarity and embedding meaningful trajectories, especially when human preferences are ambiguous or noisy.

Contribution

It proposes an offline PbRL method that learns a trajectory embedding space to better distinguish preferences and select clearer queries, enhancing label efficiency.

Findings

01

Outperforms baselines in non-ideal teacher scenarios

02

Effective with real human feedback

03

Learns meaningful trajectory embeddings

Abstract

Preference-based reinforcement learning (PbRL) bypasses explicit reward engineering by inferring reward functions from human preference comparisons, enabling better alignment with human intentions. However, humans often struggle to label a clear preference between similar segments, reducing label efficiency and limiting PbRL's real-world applicability. To address this, we propose an offline PbRL method: Contrastive LeArning for ResolvIng Ambiguous Feedback (CLARIFY), which learns a trajectory embedding space that incorporates preference information, ensuring clearly distinguished segments are spaced apart, thus facilitating the selection of more unambiguous queries. Extensive experiments demonstrate that CLARIFY outperforms baselines in both non-ideal teachers and real human feedback settings. Our approach not only selects more distinguished queries but also learns meaningful trajectory…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

moonoutcloudback/clarify_pbrl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Graph Neural Networks · Multimodal Machine Learning Applications

MethodsContrastive Learning