Visual Imitation Learning of Non-Prehensile Manipulation Tasks with Dynamics-Supervised Models
Abdullah Mustafa, Ryo Hanai, Ixchel Ramirez, Floris Erich, Ryoichi, Nakajo, Yukiyasu Domae, Tetsuya Ogata

TL;DR
This paper introduces a dynamics-supervised world model for visual imitation learning in non-prehensile manipulation, significantly improving task success rates by directly predicting dynamic states alongside RGB images.
Contribution
It proposes a novel dynamics mapping approach that enhances world models with direct supervision of dynamic states, improving generalization and performance in dynamic manipulation tasks.
Findings
Dynamics mapping increased success rate from 21% to 85%.
World models with dynamics supervision generalized better within domain.
Frozen dynamics-informed models struggled with out-of-domain dynamics.
Abstract
Unlike quasi-static robotic manipulation tasks like pick-and-place, dynamic tasks such as non-prehensile manipulation pose greater challenges, especially for vision-based control. Successful control requires the extraction of features relevant to the target task. In visual imitation learning settings, these features can be learnt by backpropagating the policy loss through the vision backbone. Yet, this approach tends to learn task-specific features with limited generalizability. Alternatively, learning world models can realize more generalizable vision backbones. Utilizing the learnt features, task-specific policies are subsequently trained. Commonly, these models are trained solely to predict the next RGB state from the current state and action taken. But only-RGB prediction might not fully-capture the task-relevant dynamics. In this work, we hypothesize that direct supervision of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Teleoperation and Haptic Systems · Robotic Mechanisms and Dynamics
