Infinite-Resolution Integral Noise Warping for Diffusion Models
Yitong Deng, Winnie Lin, Lingxiao Li, Dmitriy Smirnov, Ryan Burgert,, Ning Yu, Vincent Dedun, Mohammad H. Taghavi

TL;DR
This paper introduces an efficient infinite-resolution noise warping algorithm for diffusion models, improving temporal consistency in video generation while reducing computational costs significantly.
Contribution
It develops a novel algorithm based on Brownian bridges that achieves infinite-resolution accuracy with lower computational complexity compared to previous methods.
Findings
The new algorithm matches the accuracy of high-resolution methods.
It reduces computational costs by orders of magnitude.
Effective in real-world video generation applications.
Abstract
Adapting pretrained image-based diffusion models to generate temporally consistent videos has become an impactful generative modeling research direction. Training-free noise-space manipulation has proven to be an effective technique, where the challenge is to preserve the Gaussian white noise distribution while adding in temporal consistency. Recently, Chang et al. (2024) formulated this problem using an integral noise representation with distribution-preserving guarantees, and proposed an upsampling-based algorithm to compute it. However, while their mathematical formulation is advantageous, the algorithm incurs a high computational cost. Through analyzing the limiting-case behavior of their algorithm as the upsampling resolution goes to infinity, we develop an alternative algorithm that, by gathering increments of multiple Brownian bridges, achieves their infinite-resolution accuracy…
Peer Reviews
Decision·ICLR 2025 Poster
1) The paper is very well written and the ideas are presented in a natural way. 2) The method is significantly faster compared to the baseline. 3) The fuzzy variant of the method is very clever: significant computation time is saved and the method is more robust as evaluated in real-world applications. 4) The ideas presented are sound and the Euclidean View offers an alternative way to think about the problem of noise warping that I enjoyed reading about.
1) The biggest weakness of this paper is that the experimental results are very weak. I downloaded the supplementary material and looked at the videos; the quality is unsatisfactory. While the proposed method offers more consistency than the existing baselines, it is still very inconsistent. The extent of inconsistency is so high, that I can't think of any applications that would leverage this method. 2) At the same time, I am not very confident that the considered approach is meaningful. It i
- The new interpretation/formulation of integral noise warping is novel and leads to theoretically backed relation to Brownian bridges, whose Markovian property allows to derive convenient implementation methods. - The derived methods are useful in practical applications where image-based diffusion models are used for video generation in a training-free manner. - The paper is well-written.
- There is still inconsistency in the details in the frames. - The quantitative SDEdit-based video results of the method (and HIWYN) are similar to the competitors.
- By using Brownian bridge increments instead of high-resolution upsampling, the proposed method achieves efficient noise manipulation without sacrificing accuracy, greatly enhancing speed and memory efficiency. - The method maintains temporal consistency of noise in video generation, avoiding flickering or inconsistencies between frames seen in traditional methods, resulting in smoother and more natural video outputs. - The particle-based noise overlap calculation method is not only faster but
See Questions.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAcoustic Wave Phenomena Research
MethodsDiffusion
