Multi-Type Preference Learning: Empowering Preference-Based   Reinforcement Learning with Equal Preferences

Ziang Liu; Junjie Xu; Xingjiao Wu; Jing Yang; Liang He

arXiv:2409.07268·cs.LG·October 16, 2024

Multi-Type Preference Learning: Empowering Preference-Based Reinforcement Learning with Equal Preferences

Ziang Liu, Junjie Xu, Xingjiao Wu, Jing Yang, Liang He

PDF

Open Access 1 Repo

TL;DR

This paper introduces Multi-Type Preference Learning (MTPL), a novel approach in preference-based reinforcement learning that incorporates equal preferences, leading to improved understanding and efficiency in learning from human feedback.

Contribution

The paper proposes the Equal Preference Learning Task and MTPL, enabling simultaneous learning from equal and explicit preferences, which enhances feedback utilization in PBRL.

Findings

01

MTPL improves learning efficiency across multiple tasks.

02

Incorporating equal preferences enhances agent understanding.

03

Experimental results outperform existing baselines.

Abstract

Preference-Based reinforcement learning (PBRL) learns directly from the preferences of human teachers regarding agent behaviors without needing meticulously designed reward functions. However, existing PBRL methods often learn primarily from explicit preferences, neglecting the possibility that teachers may choose equal preferences. This neglect may hinder the understanding of the agent regarding the task perspective of the teacher, leading to the loss of important information. To address this issue, we introduce the Equal Preference Learning Task, which optimizes the neural network by promoting similar reward predictions when the behaviors of two agents are labeled as equal preferences. Building on this task, we propose a novel PBRL method, Multi-Type Preference Learning (MTPL), which allows simultaneous learning from equal preferences while leveraging existing methods for learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

feicuilengmmbb/paper_mtpl
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Criteria Decision Making