FreeGave: 3D Physics Learning from Dynamic Videos by Gaussian Velocity

Jinxi Li; Ziyang Song; Siyuan Zhou; Bo Yang

arXiv:2506.07865·cs.CV·June 10, 2025

FreeGave: 3D Physics Learning from Dynamic Videos by Gaussian Velocity

Jinxi Li, Ziyang Song, Siyuan Zhou, Bo Yang

PDF

Open Access 1 Repo

TL;DR

FreeGave is a novel method that learns 3D scene physics directly from multi-view videos without object priors, using a Gaussian velocity field and divergence-free module, outperforming existing approaches.

Contribution

It introduces a physics code and divergence-free module to learn complex 3D physical motions from videos without object priors, avoiding PINN inefficiencies.

Findings

01

Outperforms existing methods in future frame extrapolation

02

Accurately segments motion in complex scenes

03

Learns meaningful physical motion patterns without labels

Abstract

In this paper, we aim to model 3D scene geometry, appearance, and the underlying physics purely from multi-view videos. By applying various governing PDEs as PINN losses or incorporating physics simulation into neural networks, existing works often fail to learn complex physical motions at boundaries or require object priors such as masks or types. In this paper, we propose FreeGave to learn the physics of complex dynamic 3D scenes without needing any object priors. The key to our approach is to introduce a physics code followed by a carefully designed divergence-free module for estimating a per-Gaussian velocity field, without relying on the inefficient PINN losses. Extensive experiments on three public datasets and a newly collected challenging real-world dataset demonstrate the superior performance of our method for future frame extrapolation and motion segmentation. Most notably,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vlar-group/freegave
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Generative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis