Orthogonal Spatial-temporal Distributional Transfer for 4D Generation

Wei Liu; Shengqiong Wu; Bobo Li; Haoyu Zhao; Hao Fei; Mong-Li Lee; Wynne Hsu

arXiv:2603.05081·cs.CV·March 6, 2026

Orthogonal Spatial-temporal Distributional Transfer for 4D Generation

Wei Liu, Shengqiong Wu, Bobo Li, Haoyu Zhao, Hao Fei, Mong-Li Lee, Wynne Hsu

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel framework for 4D content generation that leverages spatial priors from 3D models and temporal priors from video models, overcoming dataset limitations to produce high-quality, consistent 4D synthesis.

Contribution

It proposes a new orthogonal transfer mechanism and a disentangled diffusion model to enhance 4D synthesis by effectively transferring spatial and temporal features.

Findings

01

Outperforms existing methods in 4D synthesis quality

02

Achieves superior spatial-temporal consistency

03

Demonstrates effective feature transfer across domains

Abstract

In the AIGC era, generating high-quality 4D content has garnered increasing research attention. Unfortunately, current 4D synthesis research is severely constrained by the lack of large-scale 4D datasets, preventing models from adequately learning the critical spatial-temporal features necessary for high-quality 4D generation, thus hindering progress in this domain. To combat this, we propose a novel framework that transfers rich spatial priors from existing 3D diffusion models and temporal priors from video diffusion models to enhance 4D synthesis. We develop a spatial-temporal-disentangled 4D (STD-4D) Diffusion model, which synthesizes 4D-aware videos through disentangled spatial and temporal latents. To facilitate the best feature transfer, we design a novel Orthogonal Spatial-temporal Distributional Transfer (Orster) mechanism, where the spatiotemporal feature distributions are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Orthogonal Spatial-temporal Distributional Transfer for 4D Generation· underline

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Advanced Vision and Imaging