PCHC: Enabling Preference Conditioned Humanoid Control via Multi-Objective Reinforcement Learning

Huanyu Li; Dewei Wang; Xinmiao Wang; Xinzhe Liu; Peng Liu; Chenjia Bai; Xuelong Li

arXiv:2603.24047·cs.RO·March 26, 2026

PCHC: Enabling Preference Conditioned Humanoid Control via Multi-Objective Reinforcement Learning

Huanyu Li, Dewei Wang, Xinmiao Wang, Xinzhe Liu, Peng Liu, Chenjia Bai, Xuelong Li

PDF

Open Access

TL;DR

This paper introduces PCHC, a novel multi-objective reinforcement learning framework that enables humanoid robots to adaptively balance competing goals like speed and energy efficiency through a single preference-conditioned policy.

Contribution

The paper presents a new MORL framework with a preference-conditioned policy and a Beta distribution-based alignment mechanism, allowing diverse behaviors without multiple separate policies.

Findings

01

Enables real-time adaptation of robot behavior based on preferences

02

Demonstrates effectiveness on humanoid tasks in simulation and real-world

03

Provides a spectrum of behaviors from a single policy

Abstract

Humanoid robots often need to balance competing objectives, such as maximizing speed while minimizing energy consumption. While current reinforcement learning (RL) methods can master complex skills like fall recovery and perceptive locomotion, they are constrained by fixed weighting strategies that produce a single suboptimal policy, rather than providing a diverse set of solutions for sophisticated multi-objective control. In this paper, we propose a novel framework leveraging Multi-Objective Reinforcement Learning (MORL) to achieve Preference-Conditioned Humanoid Control (PCHC). Unlike conventional methods that require training a series of policies to approximate the Pareto front, our framework enables a single, preference-conditioned policy to exhibit a wide spectrum of diverse behaviors. To effectively integrate these requirements, we introduce a Beta distribution-based alignment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Reinforcement Learning in Robotics · Social Robot Interaction and HRI