Preference-based Multi-Objective Reinforcement Learning

Ni Mu; Yao Luan; Qing-Shan Jia

arXiv:2507.14066·cs.LG·July 21, 2025

Preference-based Multi-Objective Reinforcement Learning

Ni Mu, Yao Luan, Qing-Shan Jia

PDF

TL;DR

This paper introduces preference-based MORL, integrating user preferences into the framework to efficiently find Pareto optimal policies without complex reward design, demonstrated through extensive experiments.

Contribution

It formalizes preference integration into MORL, providing theoretical guarantees and a practical method that outperforms traditional approaches in complex tasks.

Findings

01

Method performs competitively on benchmark tasks.

02

Outperforms oracle method using ground truth rewards.

03

Effective in real-world complex systems.

Abstract

Multi-objective reinforcement learning (MORL) is a structured approach for optimizing tasks with multiple objectives. However, it often relies on pre-defined reward functions, which can be hard to design for balancing conflicting goals and may lead to oversimplification. Preferences can serve as more flexible and intuitive decision-making guidance, eliminating the need for complicated reward design. This paper introduces preference-based MORL (Pb-MORL), which formalizes the integration of preferences into the MORL framework. We theoretically prove that preferences can derive policies across the entire Pareto frontier. To guide policy optimization using preferences, our method constructs a multi-objective reward model that aligns with the given preferences. We further provide theoretical proof to show that optimizing this reward model is equivalent to training the Pareto optimal policy.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.