Simultaneous Image-to-Zero and Zero-to-Noise: Diffusion Models with Analytical Image Attenuation
Yuhang Huang, Zheng Qin, Xinwang Liu, Kai Xu

TL;DR
This paper introduces a novel diffusion model that incorporates an analytical image attenuation process, enabling high-quality image generation with significantly fewer denoising steps and faster sampling compared to traditional methods.
Contribution
The authors propose a new diffusion framework that models image-to-zero and zero-to-noise mappings simultaneously, simplifying training and allowing arbitrary step size sampling without ODE solvers.
Findings
Achieves comparable image quality with only 1/20 of the denoising steps.
Attains state-of-the-art results on image-conditioned tasks with minimal steps.
Significantly accelerates image generation speed while maintaining quality.
Abstract
Recent studies have demonstrated that the forward diffusion process is crucial for the effectiveness of diffusion models in terms of generative quality and sampling efficiency. We propose incorporating an analytical image attenuation process into the forward diffusion process for high-quality (un)conditioned image generation with significantly fewer denoising steps compared to the vanilla diffusion model requiring thousands of steps. In a nutshell, our method represents the forward image-to-noise mapping as simultaneous \textit{image-to-zero} mapping and \textit{zero-to-noise} mapping. Under this framework, we mathematically derive 1) the training objectives and 2) for the reverse time the sampling formula based on an analytical attenuation function which models image to zero mapping. The former enables our method to learn noise and image components simultaneously which simplifies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neuroimaging Techniques and Applications · Model Reduction and Neural Networks
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Diffusion
