Rethinking Oversaturation in Classifier-Free Guidance via Low Frequency
Kaiyu Song, Hanjiang Lai

TL;DR
This paper presents LF-CFG, a novel method that reduces oversaturation in classifier-free guidance by adaptively down-weighting redundant low-frequency signals, improving image quality in diffusion models.
Contribution
It introduces a low-frequency perspective and an adaptive threshold-based approach to mitigate oversaturation in classifier-free guidance for diffusion models.
Findings
LF-CFG effectively reduces oversaturation and artifacts.
It improves image quality across multiple diffusion models.
The method is adaptable to various model versions.
Abstract
Classifier-free guidance (CFG) succeeds in condition diffusion models that use a guidance scale to balance the influence of conditional and unconditional terms. A high guidance scale is used to enhance the performance of the conditional term. However, the high guidance scale often results in oversaturation and unrealistic artifacts. In this paper, we introduce a new perspective based on low-frequency signals, identifying the accumulation of redundant information in these signals as the key factor behind oversaturation and unrealistic artifacts. Building on this insight, we propose low-frequency improved classifier-free guidance (LF-CFG) to mitigate these issues. Specifically, we introduce an adaptive threshold-based measurement to pinpoint the locations of redundant information. We determine a reasonable threshold by analyzing the change rate of low-frequency information between prior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTarget Tracking and Data Fusion in Sensor Networks · Guidance and Control Systems
MethodsDiffusion
