DataDream: Few-shot Guided Dataset Generation
Jae Myung Kim, Jessica Bader, Stephan Alaniz, Cordelia Schmid, Zeynep, Akata

TL;DR
DataDream is a novel framework that synthesizes high-quality, in-distribution training datasets from few-shot examples, significantly improving downstream image classification accuracy across multiple datasets.
Contribution
It introduces a method to fine-tune diffusion models with LoRA on few-shot data, enabling more faithful synthetic dataset generation for classification tasks.
Findings
Outperforms previous methods on 7 out of 10 datasets in classification accuracy
Generates more in-distribution and fine-grained images for training
Provides insights into factors affecting synthetic data quality and model performance
Abstract
While text-to-image diffusion models have been shown to achieve state-of-the-art results in image synthesis, they have yet to prove their effectiveness in downstream applications. Previous work has proposed to generate data for image classifier training given limited real data access. However, these methods struggle to generate in-distribution images or depict fine-grained features, thereby hindering the generalization of classification models trained on synthetic datasets. We propose DataDream, a framework for synthesizing classification datasets that more faithfully represents the real data distribution when guided by few-shot examples of the target classes. DataDream fine-tunes LoRA weights for the image generation model on the few real images before generating the training data using the adapted model. We then fine-tune LoRA weights for CLIP using the synthetic data to improve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
MethodsContrastive Language-Image Pre-training · Diffusion
