Generalizing from References using a Multi-Task Reference and Goal-Driven RL Framework

Jiashun Wang; M. Eva Mungai; He Li; Jean Pierre Sleiman; Jessica Hodgins; Farbod Farshidian

arXiv:2602.20375·cs.RO·February 25, 2026

Generalizing from References using a Multi-Task Reference and Goal-Driven RL Framework

Jiashun Wang, M. Eva Mungai, He Li, Jean Pierre Sleiman, Jessica Hodgins, Farbod Farshidian

PDF

Open Access

TL;DR

This paper presents a unified multi-task reinforcement learning framework that enables humanoid robots to learn natural, adaptable behaviors from reference motions and generalize to new goals without sacrificing motion quality.

Contribution

It introduces a novel multi-task RL approach that combines reference-guided imitation with goal-conditioned learning, improving motion naturalness and adaptability in humanoid control.

Findings

01

Policy transfers beyond reference distribution

02

Achieves diverse athletic behaviors like jumping and climbing

03

Demonstrates long-horizon skill composition

Abstract

Learning agile humanoid behaviors from human motion offers a powerful route to natural, coordinated control, but existing approaches face a persistent trade-off: reference-tracking policies are often brittle outside the demonstration dataset, while purely task-driven Reinforcement Learning (RL) can achieve adaptability at the cost of motion quality. We introduce a unified multi-task RL framework that bridges this gap by treating reference motion as a prior for behavioral shaping rather than a deployment-time constraint. A single goal-conditioned policy is trained jointly on two tasks that share the same observation and action spaces, but differ in their initialization schemes, command spaces, and reward structures: (i) a reference-guided imitation task in which reference trajectories define dense imitation rewards but are not provided as policy inputs, and (ii) a goal-conditioned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Human Pose and Action Recognition · Reinforcement Learning in Robotics