Shape of Motion: 4D Reconstruction from a Single Video

Qianqian Wang; Vickie Ye; Hang Gao; Weijia Zeng; Jake Austin; Zhengqi Li; Angjoo Kanazawa

arXiv:2407.13764·cs.CV·October 17, 2025

Shape of Motion: 4D Reconstruction from a Single Video

Qianqian Wang, Vickie Ye, Hang Gao, Weijia Zeng, Jake Austin, Zhengqi Li, Angjoo Kanazawa

PDF

Open Access

TL;DR

This paper presents a novel method for 4D reconstruction of dynamic scenes from a single monocular video, explicitly modeling 3D motion trajectories using low-dimensional motion bases and integrating data-driven priors for improved accuracy.

Contribution

It introduces a new approach that explicitly models 3D motion trajectories with SE(3) bases and consolidates noisy priors for globally consistent dynamic scene reconstruction.

Findings

01

Achieves state-of-the-art results in 3D/2D motion estimation

02

Effective long-range motion tracking in dynamic scenes

03

Improves novel view synthesis quality

Abstract

Monocular dynamic reconstruction is a challenging and long-standing vision problem due to the highly ill-posed nature of the task. Existing approaches depend on templates, are effective only in quasi-static scenes, or fail to model 3D motion explicitly. We introduce a method for reconstructing generic dynamic scenes, featuring explicit, persistent 3D motion trajectories in the world coordinate frame, from casually captured monocular videos. We tackle the problem with two key insights: First, we exploit the low-dimensional structure of 3D motion by representing scene motion with a compact set of SE(3) motion bases. Each point's motion is expressed as a linear combination of these bases, facilitating soft decomposition of the scene into multiple rigidly-moving groups. Second, we take advantage of off-the-shelf data-driven priors such as monocular depth maps and long-range 2D tracks, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Advanced Vision and Imaging

MethodsSparse Evolutionary Training