IntPhys 2: Benchmarking Intuitive Physics Understanding In Complex Synthetic Environments

Florian Bordes; Quentin Garrido; Justine T Kao; Adina Williams; Michael Rabbat; Emmanuel Dupoux

arXiv:2506.09849·cs.CV·June 12, 2025

IntPhys 2: Benchmarking Intuitive Physics Understanding In Complex Synthetic Environments

Florian Bordes, Quentin Garrido, Justine T Kao, Adina Williams, Michael Rabbat, Emmanuel Dupoux

PDF

Open Access 1 Repo 3 Datasets

TL;DR

IntPhys 2 is a comprehensive video benchmark designed to evaluate deep learning models' understanding of core intuitive physics principles in complex synthetic environments, revealing significant gaps compared to human performance.

Contribution

The paper introduces IntPhys 2, a new benchmark focusing on four physics principles, and evaluates state-of-the-art models, highlighting their limitations in understanding intuitive physics.

Findings

01

Models perform at chance level on the benchmark.

02

Human performance is near-perfect, indicating a gap in current models.

03

The benchmark challenges models to differentiate possible and impossible events.

Abstract

We present IntPhys 2, a video benchmark designed to evaluate the intuitive physics understanding of deep learning models. Building on the original IntPhys benchmark, IntPhys 2 focuses on four core principles related to macroscopic objects: Permanence, Immutability, Spatio-Temporal Continuity, and Solidity. These conditions are inspired by research into intuitive physical understanding emerging during early childhood. IntPhys 2 offers a comprehensive suite of tests, based on the violation of expectation framework, that challenge models to differentiate between possible and impossible events within controlled and diverse virtual environments. Alongside the benchmark, we provide performance evaluations of several state-of-the-art models. Our findings indicate that while these models demonstrate basic visual understanding, they face significant challenges in grasping intuitive physics…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

facebookresearch/intphys2
pytorchOfficial

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications