AMOR: Adaptive Character Control through Multi-Objective Reinforcement Learning

Lucas N. Alegre; Agon Serifi; Ruben Grandia; David M\"uller; Espen Knoop; Moritz B\"acher

arXiv:2505.23708·cs.RO·May 30, 2025

AMOR: Adaptive Character Control through Multi-Objective Reinforcement Learning

Lucas N. Alegre, Agon Serifi, Ruben Grandia, David M\"uller, Espen Knoop, Moritz B\"acher

PDF

TL;DR

This paper introduces a multi-objective reinforcement learning framework that trains a single, weight-conditioned policy capable of generating diverse behaviors and adapting to new tasks efficiently, reducing tuning time and improving robotic motion control.

Contribution

The authors propose a novel multi-objective RL approach that conditions policies on reward weights, enabling post-training tuning and dynamic behavior adaptation for physics-based characters and robots.

Findings

01

Policy conditioned on weights spans Pareto front of behaviors.

02

Post-training weight tuning accelerates behavior optimization.

03

Hierarchical weight selection improves task-specific motion control.

Abstract

Reinforcement learning (RL) has significantly advanced the control of physics-based and robotic characters that track kinematic reference motion. However, methods typically rely on a weighted sum of conflicting reward functions, requiring extensive tuning to achieve a desired behavior. Due to the computational cost of RL, this iterative process is a tedious, time-intensive task. Furthermore, for robotics applications, the weights need to be chosen such that the policy performs well in the real world, despite inevitable sim-to-real gaps. To address these challenges, we propose a multi-objective reinforcement learning framework that trains a single policy conditioned on a set of weights, spanning the Pareto front of reward trade-offs. Within this framework, weights can be selected and tuned after training, significantly speeding up iteration time. We demonstrate how this improved workflow…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.