Imitation Learning via Differentiable Physics

Siwei Chen; Xiao Ma; Zhongwen Xu

arXiv:2206.04873·cs.LG·June 13, 2022·1 cites

Imitation Learning via Differentiable Physics

Siwei Chen, Xiao Ma, Zhongwen Xu

PDF

Open Access 1 Repo

TL;DR

This paper introduces ILD, a novel imitation learning method that leverages differentiable physics simulators to improve training efficiency, stability, and transferability in continuous control and deformable object tasks.

Contribution

ILD eliminates the double-loop training of traditional IL methods by integrating differentiable physics into a single-loop, enhancing performance and generalization.

Findings

01

Outperforms state-of-the-art methods in continuous control tasks

02

Requires only one expert demonstration

03

Successfully applied to deformable object manipulation

Abstract

Existing imitation learning (IL) methods such as inverse reinforcement learning (IRL) usually have a double-loop training process, alternating between learning a reward function and a policy and tend to suffer long training time and high variance. In this work, we identify the benefits of differentiable physics simulators and propose a new IL method, i.e., Imitation Learning via Differentiable Physics (ILD), which gets rid of the double-loop design and achieves significant improvements in final performance, convergence speed, and stability. The proposed ILD incorporates the differentiable physics simulator as a physics prior into its computational graph for policy learning. It unrolls the dynamics by sampling actions from a parameterized policy, simply minimizing the distance between the expert trajectory and the agent trajectory, and back-propagating the gradient into the policy via…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sail-sg/ild
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning