FastInit: Fast Noise Initialization for Temporally Consistent Video Generation

Chengyu Bai; Yuming Li; Zhongyu Zhao; Jintao Chen; Peidong Jia; Qi She; Ming Lu; Shanghang Zhang

arXiv:2506.16119·cs.CV·August 6, 2025

FastInit: Fast Noise Initialization for Temporally Consistent Video Generation

Chengyu Bai, Yuming Li, Zhongyu Zhao, Jintao Chen, Peidong Jia, Qi She, Ming Lu, Shanghang Zhang

PDF

Open Access

TL;DR

FastInit introduces a single-pass noise prediction method that significantly improves the efficiency and temporal consistency of text-to-video generation without iterative refinement.

Contribution

We propose FastInit, a novel fast noise initialization technique with a trained VNPNet that enhances video quality and consistency during inference.

Findings

01

Improves temporal consistency of generated videos

02

Reduces computational cost compared to iterative methods

03

Enhances quality of text-to-video generation

Abstract

Video generation has made significant strides with the development of diffusion models; however, achieving high temporal consistency remains a challenging task. Recently, FreeInit identified a training-inference gap and introduced a method to iteratively refine the initial noise during inference. However, iterative refinement significantly increases the computational cost associated with video generation. In this paper, we introduce FastInit, a fast noise initialization method that eliminates the need for iterative refinement. FastInit learns a Video Noise Prediction Network (VNPNet) that takes random noise and a text prompt as input, generating refined noise in a single forward pass. Therefore, FastInit greatly enhances the efficiency of video generation while achieving high temporal consistency across frames. To train the VNPNet, we create a large-scale dataset consisting of pairs of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Model Reduction and Neural Networks