Tiny models from tiny data: Textual and null-text inversion for few-shot   distillation

Erik Landolsi; Fredrik Kahl

arXiv:2406.03146·cs.CV·March 21, 2025

Tiny models from tiny data: Textual and null-text inversion for few-shot distillation

Erik Landolsi, Fredrik Kahl

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel diffusion model inversion technique called TINT for few-shot distillation, enabling small models to achieve high accuracy efficiently using synthetic data, with theoretical analysis and practical benchmarks.

Contribution

The paper presents TINT, a new diffusion inversion method that improves few-shot distillation by generating diverse and specific synthetic data, achieving state-of-the-art results with reduced computational cost.

Findings

01

TINT outperforms previous methods in small model accuracy on benchmarks.

02

Theoretical analysis links estimator variance to episode and query count.

03

Synthetic data from TINT surpasses real data in training effectiveness.

Abstract

Few-shot learning deals with problems such as image classification using very few training examples. Recent vision foundation models show excellent few-shot transfer abilities, but are large and slow at inference. Using knowledge distillation, the capabilities of high-performing but slow models can be transferred to tiny, efficient models. However, common distillation methods require a large set of unlabeled data, which is not available in the few-shot setting. To overcome this lack of data, there has been a recent interest in using synthetic data. We expand on this line of research by presenting a novel diffusion model inversion technique (TINT) combining the diversity of textual inversion with the specificity of null-text inversion. Using this method in a few-shot distillation pipeline leads to state-of-the-art accuracy among small student models on popular benchmarks, while being…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pixwse/tiny2
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Generative Adversarial Networks and Image Synthesis

MethodsSparse Evolutionary Training · Diffusion