PIP: Physical Interaction Prediction via Mental Simulation with Span Selection
Jiafei Duan, Samson Yu, Soujanya Poria, Bihan Wen, Cheston Tan

TL;DR
This paper introduces PIP, a novel model that predicts outcomes of complex physical interactions by simulating future frames and selectively focusing on key moments, outperforming existing models and humans.
Contribution
PIP is the first model to combine mental simulation with span selection for long, multi-object physical interaction prediction, enhancing accuracy and interpretability.
Findings
PIP outperforms baseline and human predictions on the SPACE+ dataset.
Span selection effectively identifies key interaction frames.
PIP achieves superior accuracy in long sequence physical predictions.
Abstract
Accurate prediction of physical interaction outcomes is a crucial component of human intelligence and is important for safe and efficient deployments of robots in the real world. While there are existing vision-based intuitive physics models that learn to predict physical interaction outcomes, they mostly focus on generating short sequences of future frames based on physical properties (e.g. mass, friction and velocity) extracted from visual inputs or a latent space. However, there is a lack of intuitive physics models that are tested on long physical interaction sequences with multiple interactions among different objects. We hypothesize that selective temporal attention during approximate mental simulations helps humans in physical interaction outcome prediction. With these motivations, we propose a novel scheme: Physical Interaction Prediction via Mental Simulation with Span…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Generative Adversarial Networks and Image Synthesis · Data Visualization and Analytics
