DataDream: Few-shot Guided Dataset Generation

Jae Myung Kim; Jessica Bader; Stephan Alaniz; Cordelia Schmid; Zeynep; Akata

arXiv:2407.10910·cs.CV·July 17, 2024

DataDream: Few-shot Guided Dataset Generation

Jae Myung Kim, Jessica Bader, Stephan Alaniz, Cordelia Schmid, Zeynep, Akata

PDF

Open Access 2 Repos

TL;DR

DataDream is a novel framework that synthesizes high-quality, in-distribution training datasets from few-shot examples, significantly improving downstream image classification accuracy across multiple datasets.

Contribution

It introduces a method to fine-tune diffusion models with LoRA on few-shot data, enabling more faithful synthetic dataset generation for classification tasks.

Findings

01

Outperforms previous methods on 7 out of 10 datasets in classification accuracy

02

Generates more in-distribution and fine-grained images for training

03

Provides insights into factors affecting synthetic data quality and model performance

Abstract

While text-to-image diffusion models have been shown to achieve state-of-the-art results in image synthesis, they have yet to prove their effectiveness in downstream applications. Previous work has proposed to generate data for image classifier training given limited real data access. However, these methods struggle to generate in-distribution images or depict fine-grained features, thereby hindering the generalization of classification models trained on synthetic datasets. We propose DataDream, a framework for synthesizing classification datasets that more faithfully represents the real data distribution when guided by few-shot examples of the target classes. DataDream fine-tunes LoRA weights for the image generation model on the few real images before generating the training data using the adapted model. We then fine-tune LoRA weights for CLIP using the synthetic data to improve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification

MethodsContrastive Language-Image Pre-training · Diffusion