Neural probabilistic motor primitives for humanoid control
Josh Merel, Leonard Hasenclever, Alexandre Galashov, Arun Ahuja, Vu, Pham, Greg Wayne, Yee Whye Teh, Nicolas Heess

TL;DR
This paper introduces a neural probabilistic motor primitive system that learns a compact, flexible motor module for humanoid control, capable of one-shot imitation and task reuse, trained entirely offline from expert policies.
Contribution
It proposes a novel neural architecture with a latent-variable bottleneck for learning a universal motor primitive space from expert policies, enabling flexible humanoid control.
Findings
Successfully compresses thousands of expert policies into a single model.
Achieves one-shot imitation of unseen humanoid behaviors.
Facilitates training controllers to reuse learned primitives for new tasks.
Abstract
We focus on the problem of learning a single motor module that can flexibly express a range of behaviors for the control of high-dimensional physically simulated humanoids. To do this, we propose a motor architecture that has the general structure of an inverse model with a latent-variable bottleneck. We show that it is possible to train this model entirely offline to compress thousands of expert policies and learn a motor primitive embedding space. The trained neural probabilistic motor primitive system can perform one-shot imitation of whole-body humanoid behaviors, robustly mimicking unseen trajectories. Additionally, we demonstrate that it is also straightforward to train controllers to reuse the learned motor primitive space to solve tasks, and the resulting movements are relatively naturalistic. To support the training of our model, we compare two approaches for offline policy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMotor Control and Adaptation · Action Observation and Synchronization
