Physical Reasoning Using Dynamics-Aware Models
Eltayeb Ahmed, Anton Bakhtin, Laurens van der Maaten, Rohit Girdhar

TL;DR
This paper introduces a dynamics-aware modeling approach that enhances physical reasoning by incorporating self-supervised signals about object trajectories, leading to improved performance on the PHYRE benchmark.
Contribution
It proposes augmenting reward signals with self-supervised object dynamics to improve physical reasoning models, achieving state-of-the-art results.
Findings
Significant performance improvements on PHYRE benchmark
Self-supervised signals enhance physical reasoning accuracy
Trajectory similarity measures improve model training
Abstract
A common approach to solving physical reasoning tasks is to train a value learner on example tasks. A limitation of such an approach is that it requires learning about object dynamics solely from reward values assigned to the final state of a rollout of the environment. This study aims to address this limitation by augmenting the reward value with self-supervised signals about object dynamics. Specifically, we train the model to characterize the similarity of two environment rollouts, jointly with predicting the outcome of the reasoning task. This similarity can be defined as a distance measure between the trajectory of objects in the two rollouts, or learned directly from pixels using a contrastive formulation. Empirically, we find that this approach leads to substantial performance improvements on the PHYRE benchmark for physical reasoning (Bakhtin et al., 2019), establishing a new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Multimodal Machine Learning Applications
