Dynamic Video Generation: Shaping Video Generation Across Time and Space

Shikang Zheng; Jingkai Huang; Jiacheng Liu; Guantao Chen; Lixuan; Yuqi Lin; Peiliang Cai; Linfeng Zhang

arXiv:2605.21042·cs.CV·May 21, 2026

Dynamic Video Generation: Shaping Video Generation Across Time and Space

Shikang Zheng, Jingkai Huang, Jiacheng Liu, Guantao Chen, Lixuan, Yuqi Lin, Peiliang Cai, Linfeng Zhang

PDF

TL;DR

DVG is a dynamic framework for video generation that adaptively accelerates diffusion models across space and time, achieving significant speedups with minimal quality loss.

Contribution

It introduces a content-aware, automatic computation allocation method for efficient video diffusion, scalable across models and tasks without manual tuning.

Findings

01

Up to 7x speedup on HunyuanVideo models

02

18x acceleration with distillation

03

Near-lossless acceleration across diverse tasks

Abstract

Diffusion models have achieved impressive performance in video generation, but their iterative denoising process remains computationally expensive due to the large number of tokens processed at each timestep. Recently, progressive resolution sampling has emerged as a promising acceleration approach by reducing latent resolution in early stages. However, scaling this idea to video generation remains challenging, as the additional temporal dimension introduces diverse spatio-temporal demands across different videos, and compressing only a single dimension often leads to limited acceleration or degraded quality. Therefore, we propose DVG, a Dynamic Video Generation framework that jointly allocates computation across time and space, automatically selecting content-aware acceleration strategies without manual tuning or retraining. DVG achieves near-lossless acceleration across models and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.