Generalizing from References using a Multi-Task Reference and Goal-Driven RL Framework
Jiashun Wang, M. Eva Mungai, He Li, Jean Pierre Sleiman, Jessica Hodgins, Farbod Farshidian

TL;DR
This paper presents a unified multi-task reinforcement learning framework that enables humanoid robots to learn natural, adaptable behaviors from reference motions and generalize to new goals without sacrificing motion quality.
Contribution
It introduces a novel multi-task RL approach that combines reference-guided imitation with goal-conditioned learning, improving motion naturalness and adaptability in humanoid control.
Findings
Policy transfers beyond reference distribution
Achieves diverse athletic behaviors like jumping and climbing
Demonstrates long-horizon skill composition
Abstract
Learning agile humanoid behaviors from human motion offers a powerful route to natural, coordinated control, but existing approaches face a persistent trade-off: reference-tracking policies are often brittle outside the demonstration dataset, while purely task-driven Reinforcement Learning (RL) can achieve adaptability at the cost of motion quality. We introduce a unified multi-task RL framework that bridges this gap by treating reference motion as a prior for behavioral shaping rather than a deployment-time constraint. A single goal-conditioned policy is trained jointly on two tasks that share the same observation and action spaces, but differ in their initialization schemes, command spaces, and reward structures: (i) a reference-guided imitation task in which reference trajectories define dense imitation rewards but are not provided as policy inputs, and (ii) a goal-conditioned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Human Pose and Action Recognition · Reinforcement Learning in Robotics
