Few-Shot Diffusion Models
Giorgio Giannone, Didrik Nielsen, Ole Winther

TL;DR
Few-Shot Diffusion Models (FSDM) enable high-quality image generation from new classes with only a few samples by leveraging conditional DDPMs and a set-based Vision Transformer for effective few-shot learning.
Contribution
The paper introduces FSDM, a novel framework that adapts diffusion models for few-shot generation using patch-based set conditioning with a Vision Transformer.
Findings
FSDM can generate diverse images from unseen classes with as few as 5 samples.
Conditioning on patch-based set information improves training convergence.
FSDM outperforms baseline diffusion models in few-shot learning benchmarks.
Abstract
Denoising diffusion probabilistic models (DDPM) are powerful hierarchical latent variable models with remarkable sample generation quality and training stability. These properties can be attributed to parameter sharing in the generative hierarchy, as well as a parameter-free diffusion-based inference procedure. In this paper, we present Few-Shot Diffusion Models (FSDM), a framework for few-shot generation leveraging conditional DDPMs. FSDMs are trained to adapt the generative process conditioned on a small set of images from a given class by aggregating image patch information using a set-based Vision Transformer (ViT). At test time, the model is able to generate samples from previously unseen classes conditioned on as few as 5 samples from that class. We empirically show that FSDM can perform few-shot generation and transfer to new datasets. We benchmark variants of our method on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neuroimaging Techniques and Applications · Model Reduction and Neural Networks
MethodsAttention Is All You Need · Linear Layer · Diffusion · Softmax · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Multi-Head Attention · Absolute Position Encodings · Dropout · Adam
