Warm Diffusion: Recipe for Blur-Noise Mixture Diffusion Models
Hao-Chien Hsueh, Chi-En Yen, Wen-Hsiao Peng, Ching-Chun Huang

TL;DR
Warm Diffusion introduces a unified model that combines blurring and noise in diffusion processes, leveraging spectral analysis to improve image generation by balancing randomness and correlation.
Contribution
This paper proposes the Warm Diffusion model, integrating hot and cold diffusion paradigms through spectral analysis for improved image generation.
Findings
Effective image generation across benchmarks
Spectral analysis clarifies the trade-off between noise and blur
Disentangling denoising and deblurring simplifies training
Abstract
Diffusion probabilistic models have achieved remarkable success in generative tasks across diverse data types. While recent studies have explored alternative degradation processes beyond Gaussian noise, this paper bridges two key diffusion paradigms: hot diffusion, which relies entirely on noise, and cold diffusion, which uses only blurring without noise. We argue that hot diffusion fails to exploit the strong correlation between high-frequency image detail and low-frequency structures, leading to random behaviors in the early steps of generation. Conversely, while cold diffusion leverages image correlations for prediction, it neglects the role of noise (randomness) in shaping the data manifold, resulting in out-of-manifold issues and partially explaining its performance drop. To integrate both strengths, we propose Warm Diffusion, a unified Blur-Noise Mixture Diffusion Model (BNMD), to…
Peer Reviews
Decision·ICLR 2025 Poster
The paper is well-written
1.An improvement of 1-2 points in FID does not result in any noticeable change in visual effects. In fact, the actual visual quality may not be better than those with lower FID scores. 2.I hope the authors can present results that are sufficiently stunning or impactful. There are currently many papers in this area, and everyone is focused on slightly improving FID and IS, but the visual quality is still much worse than the current FLUX. This leaves me with no motivation to decide whether to acc
1. The idea of combining Blur and Noise is impressive as it exploits spectral dependencies of images while preserving the data manifold. 2. The proposed BNMD framework is evaluated across several benchmarks, including CIFAR-10, FFHQ, and LSUN-church datasets, sufficiently verifying the effectiveness of the proposed method.
1. I think the motivation of the work is not very clearly presented in the introduction section. The exact necessity of introducing blurring into the diffusion model should be given, which is the most important motivation of this work. Figure 5 shows that a lower BNR brings better FID results, which somehow seems to say that “blur” does not help to improve the quality of the generated image. 2. For the sampling process presented in algorithm 2, the sampling starts from a zero-mean Gaussian dist
1. The authors analyze the characteristics and limitations of both hot and cold diffusion, leading to a unified diffusion architecture that combines both approaches. 2. The authors introduce the new concept of Blur-to-Noise Ratio (BNR), which enables better analysis of diffusion models. 3. Extensive quantitative and qualitative analyses, including comparisons with state-of-the-art methods and detailed data analysis, are provided. 4. Experimental results demonstrate the effectiveness of the propo
1. In Figure 1, the authors do not sufficiently explain the meaning of each module. For example, the significance of different-sized circles in the left chart is unclear, as well as the meaning of “Data manifold (indexed by noise level)” and the images depicted. It is suggested that the caption be revised to simplify understanding. 2. The authors use the improved DDPM++/NCSN++. However, it would be beneficial to experiment with the proposed approach on other baseline architectures, such as the o
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Image and Video Quality Assessment
