Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images
Roozbeh Mottaghi, Hessam Bagherinezhad, Mohammad Rastegari, Ali, Farhadi

TL;DR
This paper introduces a neural network model that predicts the physical dynamics of objects in static images by mapping them to Newtonian scenarios, enabling physical reasoning about forces and motion.
Contribution
The paper presents the Newtonian Neural Network ($N^3$), a novel approach that links static images to physical dynamics through intermediate Newtonian scenarios, and introduces the VIND dataset for this task.
Findings
Successfully predicts object dynamics from single images.
Provides physical reasoning with velocity and force vectors.
Creates a new dataset for Newtonian dynamics in images.
Abstract
In this paper, we study the challenging problem of predicting the dynamics of objects in static images. Given a query object in an image, our goal is to provide a physical understanding of the object in terms of the forces acting upon it and its long term motion as response to those forces. Direct and explicit estimation of the forces and the motion of objects from a single image is extremely challenging. We define intermediate physical abstractions called Newtonian scenarios and introduce Newtonian Neural Network () that learns to map a single image to a state in a Newtonian scenario. Our experimental evaluations show that our method can reliably predict dynamics of a query object from a single image. In addition, our approach can provide physical reasoning that supports the predicted dynamics in terms of velocity and force vectors. To spur research in this direction we compiled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
