MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE

Ruijie Zhu; Jiahao Lu; Wenbo Hu; Xiaoguang Han; Jianfei Cai; Ying Shan; Chuanxia Zheng

arXiv:2602.08961·cs.CV·March 31, 2026

MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE

Ruijie Zhu, Jiahao Lu, Wenbo Hu, Xiaoguang Han, Jianfei Cai, Ying Shan, Chuanxia Zheng

PDF

2 Repos 1 Models

TL;DR

MotionCrafter introduces a novel framework using a 4D VAE to jointly reconstruct 4D geometry and dense motion from monocular videos, outperforming previous methods without post-optimization.

Contribution

It proposes a new joint representation and training strategy for 4D VAE that improves geometry and motion reconstruction quality from monocular videos.

Findings

01

Achieves 38.64% improvement in geometry reconstruction

02

Achieves 25.0% improvement in motion reconstruction

03

Outperforms prior methods on multiple datasets

Abstract

We present MotionCrafter, a framework that leverages video generators to jointly reconstruct 4D geometry and estimate dense motion from a monocular video. The key idea is a joint representation of dense 3D point maps and 3D scene flows in a shared coordinate system, together with a 4D VAE tailored to learn this representation effectively. Unlike prior work that strictly aligns 3D values and latents with RGB VAE latents-despite their fundamentally different distributions-we show that such alignment is unnecessary and can hurt performance. Instead, we propose a new data normalization and VAE training strategy that better transfers diffusion priors and greatly improves reconstruction quality. Extensive experiments on multiple datasets show that MotionCrafter achieves state-of-the-art performance in both geometry reconstruction and dense scene flow estimation, delivering 38.64% and 25.0%…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
TencentARC/MotionCrafter
model· ♡ 12
♡ 12

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.