Imitation Learning via Differentiable Physics
Siwei Chen, Xiao Ma, Zhongwen Xu

TL;DR
This paper introduces ILD, a novel imitation learning method that leverages differentiable physics simulators to improve training efficiency, stability, and transferability in continuous control and deformable object tasks.
Contribution
ILD eliminates the double-loop training of traditional IL methods by integrating differentiable physics into a single-loop, enhancing performance and generalization.
Findings
Outperforms state-of-the-art methods in continuous control tasks
Requires only one expert demonstration
Successfully applied to deformable object manipulation
Abstract
Existing imitation learning (IL) methods such as inverse reinforcement learning (IRL) usually have a double-loop training process, alternating between learning a reward function and a policy and tend to suffer long training time and high variance. In this work, we identify the benefits of differentiable physics simulators and propose a new IL method, i.e., Imitation Learning via Differentiable Physics (ILD), which gets rid of the double-loop design and achieves significant improvements in final performance, convergence speed, and stability. The proposed ILD incorporates the differentiable physics simulator as a physics prior into its computational graph for policy learning. It unrolls the dynamics by sampling actions from a parameterized policy, simply minimizing the distance between the expert trajectory and the agent trajectory, and back-propagating the gradient into the policy via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning
