Pareto Set Learning for Multi-Objective Reinforcement Learning

Erlong Liu; Yu-Chang Wu; Xiaobin Huang; Chengrui Gao; Ren-Jian Wang,; Ke Xue; Chao Qian

arXiv:2501.06773·cs.LG·January 15, 2025

Pareto Set Learning for Multi-Objective Reinforcement Learning

Erlong Liu, Yu-Chang Wu, Xiaobin Huang, Chengrui Gao, Ren-Jian Wang,, Ke Xue, Chao Qian

PDF

1 Video

TL;DR

This paper introduces PSL-MORL, a novel framework that uses hypernetworks to generate diverse policies for multi-objective reinforcement learning, effectively covering the Pareto front and outperforming existing methods.

Contribution

The paper presents a decomposition-based hypernetwork framework for MORL that produces personalized policies for different preferences, enhancing Pareto front coverage and efficiency.

Findings

01

Achieves dense Pareto front coverage in experiments.

02

Outperforms state-of-the-art MORL methods in hypervolume.

03

Demonstrates theoretical guarantees of model capacity and policy optimality.

Abstract

Multi-objective decision-making problems have emerged in numerous real-world scenarios, such as video games, navigation and robotics. Considering the clear advantages of Reinforcement Learning (RL) in optimizing decision-making processes, researchers have delved into the development of Multi-Objective RL (MORL) methods for solving multi-objective decision problems. However, previous methods either cannot obtain the entire Pareto front, or employ only a single policy network for all the preferences over multiple objectives, which may not produce personalized solutions for each preference. To address these limitations, we propose a novel decomposition-based framework for MORL, Pareto Set Learning for MORL (PSL-MORL), that harnesses the generation capability of hypernetwork to produce the parameters of the policy network for each decomposition weight, generating relatively distinct…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Pareto Set Learning for Multi-Objective Reinforcement Learning· underline

Taxonomy

MethodsHyperNetwork · Sparse Evolutionary Training