GIFT: Unlocking Full Potential of Labels in Distilled Dataset at   Near-zero Cost

Xinyi Shang; Peng Sun; Tao Lin

arXiv:2405.14736·cs.CV·March 7, 2025

GIFT: Unlocking Full Potential of Labels in Distilled Dataset at Near-zero Cost

Xinyi Shang, Peng Sun, Tao Lin

PDF

Open Access 1 Repo

TL;DR

This paper introduces GIFT, a simple yet effective method to fully utilize labels in dataset distillation, significantly improving performance and generalization across datasets without extra costs.

Contribution

GIFT provides a universal loss function and label refinement technique that enhances dataset distillation effectiveness and cross-optimizer generalization.

Findings

01

GIFT improves state-of-the-art results across datasets.

02

GIFT enhances cross-optimizer generalization notably.

03

GIFT achieves significant performance gains on ImageNet-1K.

Abstract

Recent advancements in dataset distillation have demonstrated the significant benefits of employing soft labels generated by pre-trained teacher models. In this paper, we introduce a novel perspective by emphasizing the full utilization of labels. We first conduct a comprehensive comparison of various loss functions for soft label utilization in dataset distillation, revealing that the model trained on the synthetic dataset exhibits high sensitivity to the choice of loss function for soft label utilization. This finding highlights the necessity of a universal loss function for training models on synthetic datasets. Building on these insights, we introduce an extremely simple yet surprisingly effective plug-and-play approach, GIFT, which encompasses soft label refinement and a cosine similarity-based loss function to efficiently leverage full label information. Extensive experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lins-lab/gift
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Drug Discovery Methods · Machine Learning and Data Classification