Dataset Distillation for Pre-Trained Self-Supervised Vision Models

George Cazenavette; Antonio Torralba; Vincent Sitzmann

arXiv:2511.16674·cs.CV·November 21, 2025

Dataset Distillation for Pre-Trained Self-Supervised Vision Models

George Cazenavette, Antonio Torralba, Vincent Sitzmann

PDF

Open Access

TL;DR

This paper introduces Linear Gradient Matching, a dataset distillation method that creates synthetic data to effectively train linear probes on large pre-trained vision models, outperforming real data baselines and enabling cross-model generalization.

Contribution

The paper proposes a novel dataset distillation approach tailored for pre-trained self-supervised vision models, focusing on training linear classifiers and ensuring cross-model transferability.

Findings

01

Synthetic datasets outperform real-image baselines.

02

Method generalizes across different pre-trained models.

03

Effective for fine-grained classification and interpretability.

Abstract

The task of dataset distillation aims to find a small set of synthetic images such that training a model on them reproduces the performance of the same model trained on a much larger dataset of real samples. Existing distillation methods focus on synthesizing datasets that enable training randomly initialized models. In contrast, state-of-the-art vision approaches are increasingly building on large, pre-trained self-supervised models rather than training from scratch. In this paper, we investigate the problem of distilling datasets that enable us to optimally train linear probes on top of such large, pre-trained vision models. We introduce a method of dataset distillation for this task called Linear Gradient Matching that optimizes the synthetic images such that, when passed through a pre-trained feature extractor, they induce gradients in the linear classifier similar to those produced…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Explainable Artificial Intelligence (XAI)