"What happens if..." Learning to Predict the Effect of Forces in Images
Roozbeh Mottaghi, Mohammad Rastegari, Abhinav Gupta, Ali Farhadi

TL;DR
This paper introduces a deep learning approach to predict long-term object movements in images caused by external forces, utilizing a new dataset created through physics simulation of scene images.
Contribution
It presents a novel neural network model that reasons about scene geometry and object attributes to predict object movements, along with a large-scale dataset for training.
Findings
Model successfully predicts object movements from a single image.
The dataset contains over 65,000 object movements in 10,335 images.
Deep learning approach outperforms baseline methods.
Abstract
What happens if one pushes a cup sitting on a table toward the edge of the table? How about pushing a desk against a wall? In this paper, we study the problem of understanding the movements of objects as a result of applying external forces to them. For a given force vector applied to a specific location in an image, our goal is to predict long-term sequential movements caused by that force. Doing so entails reasoning about scene geometry, objects, their attributes, and the physical rules that govern the movements of objects. We design a deep neural network model that learns long-term sequential dependencies of object movements while taking into account the geometry and appearance of the scene by combining Convolutional and Recurrent Neural Networks. Training our model requires a large-scale dataset of object movements caused by external forces. To build a dataset of forces in scenes,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Advanced Neural Network Applications
