Generative Models as a Data Source for Multiview Representation Learning
Ali Jahanian, Xavier Puig, Yonglong Tian, Phillip Isola

TL;DR
This paper explores using off-the-shelf generative models as a data source for learning visual representations, showing they can rival traditional dataset-based methods when proper sampling and training strategies are used.
Contribution
It demonstrates that generative models can serve as effective substitutes for datasets in multiview representation learning, with specific techniques to optimize performance.
Findings
Representations from generative models can outperform those from real data.
Contrastive learning benefits from latent space-based positive and negative pair sampling.
Careful sampling and training strategies are crucial for optimal results.
Abstract
Generative models are now capable of producing highly realistic images that look nearly indistinguishable from the data on which they are trained. This raises the question: if we have good enough generative models, do we still need datasets? We investigate this question in the setting of learning general-purpose visual representations from a black-box generative model rather than directly from data. Given an off-the-shelf image generator without any access to its training data, we train representations from the samples output by this generator. We compare several representation learning methods that can be applied to this setting, using the latent space of the generator to generate multiple "views" of the same semantic content. We show that for contrastive methods, this multiview data can naturally be used to identify positive pairs (nearby in latent space) and negative pairs (far apart…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
