Dataset Distillation via Vision-Language Category Prototype
Yawen Zou, Guang Li, Duo Su, Zi Wang, Jun Yu, Chao Zhang

TL;DR
This paper introduces a novel dataset distillation method that incorporates vision-language prototypes, using descriptive text generated by large language models to improve the semantic coherence and generalization of distilled datasets.
Contribution
The study proposes integrating text prototypes derived from large language models into dataset distillation, enhancing semantic understanding and broadening applicability beyond image-only approaches.
Findings
Achieves state-of-the-art validation performance
Generates logically coherent images with target objects
Demonstrates robust generalization across datasets
Abstract
Dataset distillation (DD) condenses large datasets into compact yet informative substitutes, preserving performance comparable to the original dataset while reducing storage, transmission costs, and computational consumption. However, previous DD methods mainly focus on distilling information from images, often overlooking the semantic information inherent in the data. The disregard for context hinders the model's generalization ability, particularly in tasks involving complex datasets, which may result in illogical outputs or the omission of critical objects. In this study, we integrate vision-language methods into DD by introducing text prototypes to distill language information and collaboratively synthesize data with image prototypes, thereby enhancing dataset distillation performance. Notably, the text prototypes utilized in this study are derived from descriptive text information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Advanced Neural Network Applications
MethodsFocus
