Self-Guided Diffusion Models
Vincent Tao Hu, David W Zhang, Yuki M. Asano, Gertjan J. Burghouts,, Cees G. M. Snoek

TL;DR
This paper introduces a self-guided diffusion model that eliminates the need for annotated data by using self-supervision signals, enabling flexible, diverse, and semantically consistent image generation without manual labels.
Contribution
It proposes a novel self-guided diffusion framework leveraging self-supervision, outperforming traditional guidance methods and removing dependence on annotated datasets.
Findings
Self-guided diffusion outperforms unguided models.
Self-supervised guidance can surpass ground-truth label guidance.
Method generates diverse, semantically consistent images without annotations.
Abstract
Diffusion models have demonstrated remarkable progress in image generation quality, especially when guidance is used to control the generative process. However, guidance requires a large amount of image-annotation pairs for training and is thus dependent on their availability, correctness and unbiasedness. In this paper, we eliminate the need for such annotation by instead leveraging the flexibility of self-supervision signals to design a framework for self-guided diffusion models. By leveraging a feature extraction function and a self-annotation function, our method provides guidance signals at various image granularities: from the level of holistic images to object boxes and even segmentation masks. Our experiments on single-label and multi-label image datasets demonstrate that self-labeled guidance always outperforms diffusion models without guidance and may even surpass guidance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
MethodsDiffusion
