Simultaneous Image-to-Zero and Zero-to-Noise: Diffusion Models with   Analytical Image Attenuation

Yuhang Huang; Zheng Qin; Xinwang Liu; Kai Xu

arXiv:2306.13720·cs.CV·December 2, 2024·1 cites

Simultaneous Image-to-Zero and Zero-to-Noise: Diffusion Models with Analytical Image Attenuation

Yuhang Huang, Zheng Qin, Xinwang Liu, Kai Xu

PDF

Open Access 2 Repos

TL;DR

This paper introduces a novel diffusion model that incorporates an analytical image attenuation process, enabling high-quality image generation with significantly fewer denoising steps and faster sampling compared to traditional methods.

Contribution

The authors propose a new diffusion framework that models image-to-zero and zero-to-noise mappings simultaneously, simplifying training and allowing arbitrary step size sampling without ODE solvers.

Findings

01

Achieves comparable image quality with only 1/20 of the denoising steps.

02

Attains state-of-the-art results on image-conditioned tasks with minimal steps.

03

Significantly accelerates image generation speed while maintaining quality.

Abstract

Recent studies have demonstrated that the forward diffusion process is crucial for the effectiveness of diffusion models in terms of generative quality and sampling efficiency. We propose incorporating an analytical image attenuation process into the forward diffusion process for high-quality (un)conditioned image generation with significantly fewer denoising steps compared to the vanilla diffusion model requiring thousands of steps. In a nutshell, our method represents the forward image-to-noise mapping as simultaneous \textit{image-to-zero} mapping and \textit{zero-to-noise} mapping. Under this framework, we mathematically derive 1) the training objectives and 2) for the reverse time the sampling formula based on an analytical attenuation function which models image to zero mapping. The former enables our method to learn noise and image components simultaneously which simplifies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neuroimaging Techniques and Applications · Model Reduction and Neural Networks

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Diffusion