Mind the Gap Between Synthetic and Real: Utilizing Transfer Learning to   Probe the Boundaries of Stable Diffusion Generated Data

Leonhard Hennicke; Christian Medeiros Adriano; Holger Giese; Jan; Mathias Koehler; Lukas Schott

arXiv:2405.03243·cs.CV·May 7, 2024

Mind the Gap Between Synthetic and Real: Utilizing Transfer Learning to Probe the Boundaries of Stable Diffusion Generated Data

Leonhard Hennicke, Christian Medeiros Adriano, Holger Giese, Jan, Mathias Koehler, Lukas Schott

PDF

Open Access

TL;DR

This paper investigates why models trained on synthetic data from Stable Diffusion underperform compared to real data, identifying the final layers as the main issue and proposing targeted fine-tuning to improve accuracy with less real data.

Contribution

It reveals that the accuracy gap mainly arises from the model's final layers and proposes a fine-tuning approach focused on these layers to reduce reliance on real data.

Findings

01

Final layers are the main source of performance drop.

02

Fine-tuning last layers with limited real data improves accuracy.

03

Synthetic data alone is insufficient to match real data performance.

Abstract

Generative foundation models like Stable Diffusion comprise a diverse spectrum of knowledge in computer vision with the potential for transfer learning, e.g., via generating data to train student models for downstream tasks. This could circumvent the necessity of collecting labeled real-world data, thereby presenting a form of data-free knowledge distillation. However, the resultant student models show a significant drop in accuracy compared to models trained on real data. We investigate possible causes for this drop and focus on the role of the different layers of the student model. By training these layers using either real or synthetic data, we reveal that the drop mainly stems from the model's final layers. Further, we briefly investigate other factors, such as differences in data-normalization between synthetic and real, the impact of data augmentations, texture vs.\ shape…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsDiffusion · Focus