Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion
Yongjia Ma, Junlin Chen, Donglin Di, Qi Xie, Lei Fan, Wei Chen,, Xiaofei Gou, Na Zhao, Xun Yang

TL;DR
This paper introduces GLC-Diffusion, a tuning-free approach for generating long, high-quality videos with improved temporal coherence and content consistency, using global-local denoising and a novel refinement module.
Contribution
It presents a novel tuning-free long video generation method combining global-local collaborative denoising, noise reinitialization, and motion consistency refinement.
Findings
Produces more coherent long videos than previous methods.
Effectively integrates with existing diffusion models.
Achieves superior visual quality and temporal smoothness.
Abstract
Creating high-fidelity, coherent long videos is a sought-after aspiration. While recent video diffusion models have shown promising potential, they still grapple with spatiotemporal inconsistencies and high computational resource demands. We propose GLC-Diffusion, a tuning-free method for long video generation. It models the long video denoising process by establishing denoising trajectories through Global-Local Collaborative Denoising to ensure overall content consistency and temporal coherence between frames. Additionally, we introduce a Noise Reinitialization strategy which combines local noise shuffling with frequency fusion to improve global content consistency and visual diversity. Further, we propose a Video Motion Consistency Refinement (VMCR) module that computes the gradient of pixel-wise and frequency-wise losses to enhance visual consistency and temporal smoothness.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Coding and Compression Technologies · Image and Video Quality Assessment · Multimedia Communication and Technology
MethodsDiffusion
