Bridging the Gap: Addressing Discrepancies in Diffusion Model Training   for Classifier-Free Guidance

Niket Patel; Luis Salamanca; Luis Barba

arXiv:2311.00938·cs.LG·November 3, 2023·1 cites

Bridging the Gap: Addressing Discrepancies in Diffusion Model Training for Classifier-Free Guidance

Niket Patel, Luis Salamanca, Luis Barba

PDF

Open Access

TL;DR

This paper identifies a discrepancy in diffusion model training for classifier-free guidance and proposes a new loss function to improve sample quality, robustness, and efficiency, validated on CIFAR-10 and preliminary results on Stable Diffusion.

Contribution

The paper introduces an improved loss function that aligns training with sampling behavior, enhancing diffusion model performance and robustness.

Findings

01

Higher quality samples with fewer timesteps

02

Increased robustness to guidance scale variations

03

Preliminary benefits observed on large diffusion models

Abstract

Diffusion models have emerged as a pivotal advancement in generative models, setting new standards to the quality of the generated instances. In the current paper we aim to underscore a discrepancy between conventional training methods and the desired conditional sampling behavior of these models. While the prevalent classifier-free guidance technique works well, it's not without flaws. At higher values for the guidance scale parameter $w$ , we often get out of distribution samples and mode collapse, whereas at lower values for $w$ we may not get the desired specificity. To address these challenges, we introduce an updated loss function that better aligns training objectives with sampling behaviors. Experimental validation with FID scores on CIFAR-10 elucidates our method's ability to produce higher quality samples with fewer sampling timesteps, and be more robust to the choice of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Topic Modeling

MethodsDiffusion