HumanPlus: Humanoid Shadowing and Imitation from Humans
Zipeng Fu, Qingqing Zhao, Qi Wu, Gordon Wetzstein, Chelsea Finn

TL;DR
HumanPlus presents a comprehensive system enabling humanoid robots to learn and imitate human motions and skills from visual data, combining simulation training, real-time shadowing, and autonomous behavior cloning.
Contribution
The paper introduces a full-stack approach for humanoids to learn from human data, including real-time shadowing and egocentric vision-based skill learning, addressing perception and control challenges.
Findings
Humanoids successfully shadow human motions in real time using RGB cameras.
Humanoids autonomously perform tasks with 60-100% success rates.
System demonstrates effective imitation of diverse human skills.
Abstract
One of the key arguments for building robots that have similar form factors to human beings is that we can leverage the massive human data for training. Yet, doing so has remained challenging in practice due to the complexities in humanoid perception and control, lingering physical gaps between humanoids and humans in morphologies and actuation, and lack of a data pipeline for humanoids to learn autonomous skills from egocentric vision. In this paper, we introduce a full-stack system for humanoids to learn motion and autonomous skills from human data. We first train a low-level policy in simulation via reinforcement learning using existing 40-hour human motion datasets. This policy transfers to the real world and allows humanoid robots to follow human body and hand motion in real time using only a RGB camera, i.e. shadowing. Through shadowing, human operators can teleoperate humanoids…
Peer Reviews
Decision·CoRL 2024
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition
