Improving Denoising Diffusion Models via Simultaneous Estimation of Image and Noise
Zhenkai Zhang, Krista A. Ehinger, Tom Drummond

TL;DR
This paper proposes a reparameterization of diffusion processes and simultaneous estimation of image and noise to enhance the speed and quality of image generation in diffusion models.
Contribution
It introduces a novel angle-based reparameterization and a dual estimation network, enabling faster and higher-quality image synthesis with improved stability.
Findings
Faster image generation with higher quality metrics.
Elimination of singularities in the diffusion process.
Effective use of higher order ODE solvers like Runge-Kutta.
Abstract
This paper introduces two key contributions aimed at improving the speed and quality of images generated through inverse diffusion processes. The first contribution involves reparameterizing the diffusion process in terms of the angle on a quarter-circular arc between the image and noise, specifically setting the conventional . This reparameterization eliminates two singularities and allows for the expression of diffusion evolution as a well-behaved ordinary differential equation (ODE). In turn, this allows higher order ODE solvers such as Runge-Kutta methods to be used effectively. The second contribution is to directly estimate both the image () and noise () using our network, which enables more stable calculations of the update step in the inverse diffusion steps, as accurate estimation of both the image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neuroimaging Techniques and Applications · Generative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks
MethodsDiffusion · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
