Unified Directly Denoising for Both Variance Preserving and Variance   Exploding Diffusion Models

Jingjing Wang; Dan Zhang; Feng Luo

arXiv:2405.21059·cs.CV·June 3, 2024

Unified Directly Denoising for Both Variance Preserving and Variance Exploding Diffusion Models

Jingjing Wang, Dan Zhang, Feng Luo

PDF

Open Access

TL;DR

This paper introduces a unified framework for directly denoising both Variance Preserving and Variance Exploding diffusion models, achieving state-of-the-art image generation quality with fewer steps and theoretical guarantees.

Contribution

The paper proposes uDDDM, a unified model for VP and VE diffusion, with theoretical proofs and an adaptive loss function, improving efficiency and performance.

Findings

01

uDDDM achieves competitive FID scores on CIFAR-10.

02

One-step generation with FID of 2.63 (VE) and 2.53 (VP).

03

Extended sampling reduces FID to 1.71 (VE) and 1.65 (VP).

Abstract

Previous work has demonstrated that, in the Variance Preserving (VP) scenario, the nascent Directly Denoising Diffusion Models (DDDM) can generate high-quality images in one step while achieving even better performance in multistep sampling. However, the Pseudo-LPIPS loss used in DDDM leads to concerns about the bias in assessment. Here, we propose a unified DDDM (uDDDM) framework that generates images in one-step/multiple steps for both Variance Preserving (VP) and Variance Exploding (VE) cases. We provide theoretical proofs of the existence and uniqueness of the model's solution paths, as well as the non-intersecting property of the sampling paths. Additionally, we propose an adaptive Pseudo-Huber loss function to balance the convergence to the true solution and the stability of convergence process.Through a comprehensive evaluation, we demonstrate that uDDDMs achieve FID scores…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Signal Denoising Methods

MethodsDiffusion