gradSim: Differentiable simulation for system identification and visuomotor control
Krishna Murthy Jatavallabhula, Miles Macklin, Florian Golemo and, Vikram Voleti, Linda Petrini, Martin Weiss, Breandan Considine and, Jerome Parent-Levesque, Kevin Xie, Kenny Erleben, Liam Paull and, Florian Shkurti, Derek Nowrouzezahrai, Sanja Fidler

TL;DR
gradSim introduces a differentiable simulation framework that jointly models scene dynamics and image formation, enabling physical property estimation and visuomotor control directly from videos without requiring 3D labels.
Contribution
It combines differentiable multiphysics simulation and rendering to enable end-to-end learning of physical properties from videos without 3D supervision.
Findings
Achieves competitive or superior performance to 3D-supervised methods.
Enables learning physical properties for deformable objects and cloth.
Supports visuomotor control tasks without state-based supervision.
Abstract
We consider the problem of estimating an object's physical properties such as mass, friction, and elasticity directly from video sequences. Such a system identification problem is fundamentally ill-posed due to the loss of information during image formation. Current solutions require precise 3D labels which are labor-intensive to gather, and infeasible to create for many systems such as deformable solids or cloth. We present gradSim, a framework that overcomes the dependence on 3D supervision by leveraging differentiable multiphysics simulation and differentiable rendering to jointly model the evolution of scene dynamics and image formation. This novel combination enables backpropagation from pixels in a video sequence through to the underlying physical attributes that generated them. Moreover, our unified computation graph -- spanning from the dynamics and through the rendering process…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Human Motion and Animation
