Diffusion Self-Guidance for Controllable Image Generation
Dave Epstein, Allan Jabri, Ben Poole, Alexei A. Efros, Aleksander, Holynski

TL;DR
This paper introduces self-guidance, a novel method that enables precise control over diffusion-based image generation by leveraging internal model representations, allowing complex image manipulations without additional training.
Contribution
The authors propose a self-guidance technique that guides diffusion models using their internal signals, eliminating the need for extra models or training for controllable image synthesis.
Findings
Enables control over shape, location, and appearance of objects.
Allows complex image manipulations like merging and editing.
Works without additional training or external classifiers.
Abstract
Large-scale generative models are capable of producing high-quality images from detailed text descriptions. However, many aspects of an image are difficult or impossible to convey through text. We introduce self-guidance, a method that provides greater control over generated images by guiding the internal representations of diffusion models. We demonstrate that properties such as the shape, location, and appearance of objects can be extracted from these representations and used to steer sampling. Self-guidance works similarly to classifier guidance, but uses signals present in the pretrained model itself, requiring no additional models or training. We show how a simple set of properties can be composed to perform challenging image manipulations, such as modifying the position or size of objects, merging the appearance of objects in one image with the layout of another, composing objects…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMusic and Audio Processing · Image Retrieval and Classification Techniques · Generative Adversarial Networks and Image Synthesis
MethodsDiffusion
