Taking Visual Motion Prediction To New Heightfields
Sebastien Ehrhardt, Aron Monszpart, Niloy Mitra, Andrea, Vedaldi

TL;DR
This paper presents a recurrent neural network that learns to predict the motion of objects on complex heightfields solely from visual data, enabling long-term physical extrapolation without explicit physical state modeling.
Contribution
The work introduces a novel neural network architecture that implicitly models physical states from images, improving prediction accuracy in complex real-world scenarios.
Findings
Significant improvement over existing image-based methods.
Effective long-term physical extrapolation on arbitrary heightfields.
Competitive performance with methods using explicit physical states.
Abstract
While the basic laws of Newtonian mechanics are well understood, explaining a physical scenario still requires manually modeling the problem with suitable equations and estimating the associated parameters. In order to be able to leverage the approximation capabilities of artificial intelligence techniques in such physics related contexts, researchers have handcrafted the relevant states, and then used neural networks to learn the state transitions using simulation runs as training data. Unfortunately, such approaches are unsuited for modeling complex real-world scenarios, where manually authoring relevant state spaces tend to be tedious and challenging. In this work, we investigate if neural networks can implicitly learn physical states of real-world mechanical processes only based on visual data while internally modeling non-homogeneous environment and in the process enable long-term…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
