SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time

Zhening Huang; Hyeonho Jeong; Xuelin Chen; Yulia Gryaditskaya; Tuanfeng Y. Wang; Joan Lasenby; and Chun-Hao Huang

arXiv:2512.25075·cs.CV·January 1, 2026

SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time

Zhening Huang, Hyeonho Jeong, Xuelin Chen, Yulia Gryaditskaya, Tuanfeng Y. Wang, Joan Lasenby, and Chun-Hao Huang

PDF

Open Access 1 Models

TL;DR

SpaceTimePilot introduces a novel generative model that independently controls space and time in dynamic scene rendering, enabling continuous exploration and manipulation of videos through a diffusion process with specialized training and datasets.

Contribution

The paper presents a new diffusion-based model with explicit space-time control, a temporal-warping training scheme, and a synthetic dataset, advancing controllable video generation from monocular inputs.

Findings

01

Effective space-time disentanglement demonstrated on real and synthetic data

02

Improved control over camera viewpoint and motion sequences

03

Superior results compared to prior generative video models

Abstract

We present SpaceTimePilot, a video diffusion model that disentangles space and time for controllable generative rendering. Given a monocular video, SpaceTimePilot can independently alter the camera viewpoint and the motion sequence within the generative process, re-rendering the scene for continuous and arbitrary exploration across space and time. To achieve this, we introduce an effective animation time-embedding mechanism in the diffusion process, allowing explicit control of the output video's motion sequence with respect to that of the source video. As no datasets provide paired videos of the same dynamic scene with continuous temporal variations, we propose a simple yet effective temporal-warping training scheme that repurposes existing multi-view datasets to mimic temporal differences. This strategy effectively supervises the model to learn temporal control and achieve robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
zhening/SpaceTimePilot
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · Computer Graphics and Visualization Techniques