KNN-Diffusion: Image Generation via Large-Scale Retrieval
Shelly Sheynin, Oron Ashual, Adam Polyak, Uriel Singer, Oran Gafni,, Eliya Nachmani, Yaniv Taigman

TL;DR
KNN-Diffusion introduces a retrieval-based approach enabling efficient, text-free image generation and manipulation by leveraging large-scale kNN retrieval with diffusion models, outperforming existing methods on various datasets.
Contribution
The paper presents a novel retrieval-based framework that allows training small diffusion models without text data and enables flexible image generation and editing.
Findings
Achieves state-of-the-art results in text-free image generation.
Enables out-of-distribution image synthesis by database swapping.
Supports local semantic manipulations while maintaining object identity.
Abstract
Recent text-to-image models have achieved impressive results. However, since they require large-scale datasets of text-image pairs, it is impractical to train them on new domains where data is scarce or not labeled. In this work, we propose using large-scale retrieval methods, in particular, efficient k-Nearest-Neighbors (kNN), which offers novel capabilities: (1) training a substantially small and efficient text-to-image diffusion model without any text, (2) generating out-of-distribution images by simply swapping the retrieval database at inference time, and (3) performing text-driven local semantic manipulations while preserving object identity. To demonstrate the robustness of our method, we apply our kNN approach on two state-of-the-art diffusion backbones, and show results on several different datasets. As evaluated by human studies and automatic metrics, our method achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
MethodsDiffusion
