Few-Shot Diffusion Models

Giorgio Giannone; Didrik Nielsen; Ole Winther

arXiv:2205.15463·cs.CV·June 1, 2022·20 cites

Few-Shot Diffusion Models

Giorgio Giannone, Didrik Nielsen, Ole Winther

PDF

Open Access 1 Repo

TL;DR

Few-Shot Diffusion Models (FSDM) enable high-quality image generation from new classes with only a few samples by leveraging conditional DDPMs and a set-based Vision Transformer for effective few-shot learning.

Contribution

The paper introduces FSDM, a novel framework that adapts diffusion models for few-shot generation using patch-based set conditioning with a Vision Transformer.

Findings

01

FSDM can generate diverse images from unseen classes with as few as 5 samples.

02

Conditioning on patch-based set information improves training convergence.

03

FSDM outperforms baseline diffusion models in few-shot learning benchmarks.

Abstract

Denoising diffusion probabilistic models (DDPM) are powerful hierarchical latent variable models with remarkable sample generation quality and training stability. These properties can be attributed to parameter sharing in the generative hierarchy, as well as a parameter-free diffusion-based inference procedure. In this paper, we present Few-Shot Diffusion Models (FSDM), a framework for few-shot generation leveraging conditional DDPMs. FSDMs are trained to adapt the generative process conditioned on a small set of images from a given class by aggregating image patch information using a set-based Vision Transformer (ViT). At test time, the model is able to generate samples from previously unseen classes conditioned on as few as 5 samples from that class. We empirically show that FSDM can perform few-shot generation and transfer to new datasets. We benchmark variants of our method on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

georgosgeorgos/few-shot-diffusion-models
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neuroimaging Techniques and Applications · Model Reduction and Neural Networks

MethodsAttention Is All You Need · Linear Layer · Diffusion · Softmax · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Multi-Head Attention · Absolute Position Encodings · Dropout · Adam