Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Nithin Gopalakrishnan Nair, Anoop Cherian, Suhas Lohit, Ye Wang,, Toshiaki Koike-Akino, Vishal M. Patel, Tim K. Marks

TL;DR
Steered Diffusion introduces a flexible framework that guides unconditional diffusion models at inference time for diverse zero-shot conditional image synthesis tasks, achieving high-quality results without retraining.
Contribution
It proposes a novel method to steer diffusion models using a pre-trained inverse model, enabling multi-condition control during inference for various image editing tasks.
Findings
Outperforms state-of-the-art diffusion plug-and-play models
Achieves high-quality inpainting, colorization, and editing results
Adds negligible computational overhead
Abstract
Conditional generative models typically demand large annotated training sets to achieve high-quality synthesis. As a result, there has been significant interest in designing models that perform plug-and-play generation, i.e., to use a predefined or pretrained model, which is not explicitly trained on the generative task, to guide the generative process (e.g., using language). However, such guidance is typically useful only towards synthesizing high-level semantics rather than editing fine-grained details as in image-to-image translation tasks. To this end, and capitalizing on the powerful fine-grained generative control offered by the recent diffusion-based generative models, we introduce Steered Diffusion, a generalized framework for photorealistic zero-shot conditional image generation using a diffusion model trained for unconditional generation. The key idea is to steer the image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis
MethodsDiffusion
