Learning to Identify Physical Parameters from Video Using Differentiable Physics
Rama Krishna Kandukuri, Jan Achterhold, Michael M\"oller, J\"org, St\"uckler

TL;DR
This paper introduces a method that combines differentiable physics engines with video representation learning to identify physical parameters like mass and friction from videos, enabling better physical understanding and future frame prediction.
Contribution
It presents a novel approach integrating differentiable physics with video encoding to learn physically interpretable latent representations, including supervised and self-supervised training methods.
Findings
Successfully learned physical properties from videos in simulated scenarios.
Achieved accurate future frame prediction using physical latent representations.
Outperformed baseline system identification methods in physical property estimation.
Abstract
Video representation learning has recently attracted attention in computer vision due to its applications for activity and scene forecasting or vision-based planning and control. Video prediction models often learn a latent representation of video which is encoded from input frames and decoded back into images. Even when conditioned on actions, purely deep learning based architectures typically lack a physically interpretable latent space. In this study, we use a differentiable physics engine within an action-conditional video representation network to learn a physical latent representation. We propose supervised and self-supervised learning methods to train our network and identify physical properties. The latter uses spatial transformers to decode physical states back into images. The simulation scenarios in our experiments comprise pushing, sliding and colliding objects, for which we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
