Return of Unconditional Generation: A Self-supervised Representation Generation Method
Tianhong Li, Dina Katabi, Kaiming He

TL;DR
This paper introduces Representation-Conditioned Generation (RCG), a self-supervised approach that significantly improves unconditional image generation quality by generating semantic representations without labels, achieving state-of-the-art results.
Contribution
The paper proposes RCG, a novel framework that uses self-supervised semantic representations to close the quality gap in unconditional generation without labels.
Findings
Achieves a new state-of-the-art FID of 2.15 on ImageNet 256x256.
Reduces previous best FID of 5.91 by 64%.
Unconditional generation quality rivals class-conditional methods.
Abstract
Unconditional generation -- the problem of modeling data distribution without relying on human-annotated labels -- is a long-standing and fundamental challenge in generative models, creating a potential of learning from large-scale unlabeled data. In the literature, the generation quality of an unconditional method has been much worse than that of its conditional counterpart. This gap can be attributed to the lack of semantic information provided by labels. In this work, we show that one can close this gap by generating semantic representations in the representation space produced by a self-supervised encoder. These representations can be used to condition the image generator. This framework, called Representation-Conditioned Generation (RCG), provides an effective solution to the unconditional generation problem without using labels. Through comprehensive experiments, we observe that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image and Video Retrieval Techniques · Video Analysis and Summarization
MethodsDiffusion
