Learning Pareto Set for Multi-Objective Continuous Robot Control

Tianye Shu; Ke Shang; Cheng Gong; Yang Nan; Hisao; Ishibuchi

arXiv:2406.18924·cs.AI·June 28, 2024

Learning Pareto Set for Multi-Objective Continuous Robot Control

Tianye Shu, Ke Shang, Cheng Gong, Yang Nan, Hisao, Ishibuchi

PDF

Open Access 1 Repo

TL;DR

This paper introduces a resource-efficient MORL algorithm that learns a continuous Pareto set representation using a hypernet, enabling quick generation of policies for various preferences in complex robot control tasks.

Contribution

It proposes a novel hypernet-based MORL method that efficiently approximates the Pareto set in high-dimensional spaces, reducing resource consumption compared to existing algorithms.

Findings

01

Achieves best overall performance with fewer training parameters.

02

The Pareto set forms a curved surface in parameter space, offering new insights.

03

Outperforms two state-of-the-art MORL algorithms on robot control problems.

Abstract

For a control problem with multiple conflicting objectives, there exists a set of Pareto-optimal policies called the Pareto set instead of a single optimal policy. When a multi-objective control problem is continuous and complex, traditional multi-objective reinforcement learning (MORL) algorithms search for many Pareto-optimal deep policies to approximate the Pareto set, which is quite resource-consuming. In this paper, we propose a simple and resource-efficient MORL algorithm that learns a continuous representation of the Pareto set in a high-dimensional policy parameter space using a single hypernet. The learned hypernet can directly generate various well-trained policy networks for different user preferences. We compare our method with two state-of-the-art MORL algorithms on seven multi-objective continuous robot control problems. Experimental results show that our method achieves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hisaolabsustc/hyper-morl
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Advanced Control Systems Optimization · Advanced Multi-Objective Optimization Algorithms

MethodsSparse Evolutionary Training