Dreamguider: Improved Training free Diffusion-based Conditional   Generation

Nithin Gopalakrishnan Nair; Vishal M Patel

arXiv:2406.02549·cs.CV·June 5, 2024

Dreamguider: Improved Training free Diffusion-based Conditional Generation

Nithin Gopalakrishnan Nair, Vishal M Patel

PDF

Open Access

TL;DR

Dreamguider introduces a lightweight, compute-efficient method for inference-time guidance in diffusion models, eliminating the need for backpropagation and manual parameter tuning across various tasks.

Contribution

It proposes a novel gradient regulation technique and guidance scale that work universally, along with an augmentation strategy, advancing inference guidance without heavy computation.

Findings

01

Effective guidance across multiple datasets and models

02

Elimination of backpropagation during inference

03

Significant performance boost with augmentation

Abstract

Diffusion models have emerged as a formidable tool for training-free conditional generation.However, a key hurdle in inference-time guidance techniques is the need for compute-heavy backpropagation through the diffusion network for estimating the guidance direction. Moreover, these techniques often require handcrafted parameter tuning on a case-by-case basis. Although some recent works have introduced minimal compute methods for linear inverse problems, a generic lightweight guidance solution to both linear and non-linear guidance problems is still missing. To this end, we propose Dreamguider, a method that enables inference-time guidance without compute-heavy backpropagation through the diffusion network. The key idea is to regulate the gradient flow through a time-varying factor. Moreover, we propose an empirical guidance scale that works for a wide variety of tasks, hence removing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies

MethodsDiffusion