FreqPrior: Improving Video Diffusion Models with Frequency Filtering   Gaussian Noise

Yunlong Yuan; Yuanfan Guo; Chunwei Wang; Wei Zhang; Hang Xu; Li Zhang

arXiv:2502.03496·eess.IV·February 20, 2025

FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise

Yunlong Yuan, Yuanfan Guo, Chunwei Wang, Wei Zhang, Hang Xu, Li Zhang

PDF

Open Access 1 Video

TL;DR

FreqPrior introduces a frequency domain filtering technique for noise initialization in video diffusion models, significantly enhancing generation quality and efficiency by preserving motion and details.

Contribution

The paper presents FreqPrior, a novel frequency filtering approach for noise in diffusion models, improving video quality and reducing inference time.

Findings

01

Achieves highest quality and semantic scores on VBench

02

Reduces inference time with partial sampling

03

Enhances motion dynamics and detail preservation

Abstract

Text-driven video generation has advanced significantly due to developments in diffusion models. Beyond the training and sampling phases, recent studies have investigated noise priors of diffusion models, as improved noise priors yield better generation results. One recent approach employs the Fourier transform to manipulate noise, marking the initial exploration of frequency operations in this context. However, it often generates videos that lack motion dynamics and imaging details. In this work, we provide a comprehensive theoretical analysis of the variance decay issue present in existing methods, contributing to the loss of details and motion dynamics. Recognizing the critical impact of noise distribution on generation quality, we introduce FreqPrior, a novel noise initialization strategy that refines noise in the frequency domain. Our method features a novel filtering technique…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise· slideslive

Taxonomy

TopicsImage and Signal Denoising Methods

MethodsDiffusion