Diffusion Sampling with Momentum for Mitigating Divergence Artifacts
Suttisak Wizadwongsa, Worameth Chinchuthakun, Pramook Khungurn, Amit, Raj, Supasorn Suwajanakorn

TL;DR
This paper introduces momentum-based techniques to improve the stability and quality of diffusion sampling, significantly reducing artifacts and accelerating image generation.
Contribution
It proposes two novel momentum-enhanced methods that expand stability regions and balance accuracy with artifact suppression in diffusion sampling.
Findings
Reduced divergence artifacts in low-step sampling
Outperformed state-of-the-art diffusion solvers
Enhanced image quality in experiments
Abstract
Despite the remarkable success of diffusion models in image generation, slow sampling remains a persistent issue. To accelerate the sampling process, prior studies have reformulated diffusion sampling as an ODE/SDE and introduced higher-order numerical methods. However, these methods often produce divergence artifacts, especially with a low number of sampling steps, which limits the achievable acceleration. In this paper, we investigate the potential causes of these artifacts and suggest that the small stability regions of these methods could be the principal cause. To address this issue, we propose two novel techniques. The first technique involves the incorporation of Heavy Ball (HB) momentum, a well-known technique for improving optimization, into existing diffusion numerical methods to expand their stability regions. We also prove that the resulting methods have first-order…
Peer Reviews
Decision·ICLR 2024 poster
Authors' effort in experiments seem to be solid and thorough. Authors have also been patient to review basics of stability concept in numerical ODEs.
I recommend that authors add a paragraph explaining what "sampling" means in the context of diffusion in the appendix, so that the content can be more self-contained. From what I understand about the main text, authors mean generating/inferring an image with trained diffusion models. This is not equivalent to the meaning of illustrating the distribution of all potentially generated images given underlying diffusion models. I also suggest that authors make a table to list all used numerical for
1. The divergence artifacts problem is theoretically linked with the stability region of high-order numerical solvers for ODEs. The insight is very helpful for the design of diffusion sampling methods. 2. To enlarge the stability region, authors proposed Heavy Ball (HB) and generalized Heavy Ball (GHVB) as two solution without any training. Experiments show that the divergence artifacts are great mitigated in a low-step sampling case. 3. This paper is well organized and solid in theory.
1. The proposed method should be compared with the state-of-art methods in reducing divergence artifacts if it is a big challenge in diffusion models. 2. The stated motivation is diffusion model acceleration. Experiments are limited in comparing the results of few-step sampling, lacking clear numerical experiments in model acceleration. 3. It seems that the proposed methods show superior performance only in extremely low sampling steps. In the case of decent image quality, the improvement on sam
1), Both pixel-based and latent diffusion models are considered in this paper. 2), The presentation is overall easy to follow. 3), Good practical extension to DPM++ and PLMS. 4), The literature over existing high order ODE solvers seems up to date.
1), The technical novelty behind this work seems to be not significant. The main techniques used in this paper are directly borrowed from Polyak’s heavy ball (HB) momentum method, a conventional optimization algorithm. Besides, the main improvements of this work are built based on DPM++ and PLMS. 2), While two methods are proposed in the same paper, it is unclear which one should be used under what circumstances. The paper only gives some vague statements without comprehensive comparison. 3),
Code & Models
Videos
Taxonomy
TopicsModel Reduction and Neural Networks · Advanced Neuroimaging Techniques and Applications · Domain Adaptation and Few-Shot Learning
MethodsDiffusion
