Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis

Hongyuan Chen; Xingyu Chen; Youjia Zhang; Zexiang Xu; Anpei Chen

arXiv:2601.14253·cs.CV·January 21, 2026

Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis

Hongyuan Chen, Xingyu Chen, Youjia Zhang, Zexiang Xu, Anpei Chen

PDF

Open Access 1 Models

TL;DR

Motion 3-to-4 introduces a novel framework that synthesizes high-quality 4D dynamic objects from a single monocular video by decomposing the process into shape and motion, utilizing a transformer for temporal coherence.

Contribution

The paper presents a new approach for 4D synthesis from monocular videos, combining static shape generation with motion reconstruction using a canonical mesh and a scalable transformer.

Findings

01

Outperforms prior methods in fidelity and spatial consistency

02

Effective on standard benchmarks and a new ground-truth dataset

03

Robust to varying sequence lengths

Abstract

We present Motion 3-to-4, a feed-forward framework for synthesising high-quality 4D dynamic objects from a single monocular video and an optional 3D reference mesh. While recent advances have significantly improved 2D, video, and 3D content generation, 4D synthesis remains difficult due to limited training data and the inherent ambiguity of recovering geometry and motion from a monocular viewpoint. Motion 3-to-4 addresses these challenges by decomposing 4D synthesis into static 3D shape generation and motion reconstruction. Using a canonical reference mesh, our model learns a compact motion latent representation and predicts per-frame vertex trajectories to recover complete, temporally coherent geometry. A scalable frame-wise transformer further enables robustness to varying sequence lengths. Evaluations on both standard benchmarks and a new dataset with accurate ground-truth geometry…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
River-Chen/Motion324
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis · Human Motion and Animation