Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios
Kai Wang, Zekai Li, Zhi-Qi Cheng, Samir Khaki, Ahmad Sajedi,, Ramakrishna Vedantam, Konstantinos N Plataniotis, Alexander Hauptmann, Yang, You

TL;DR
This paper introduces EDF, a dataset distillation method that emphasizes discriminative features using Grad-CAM, significantly improving performance in complex scenarios like ImageNet-1K subsets.
Contribution
EDF is a novel dataset distillation approach that enhances key discriminative regions in synthetic images and introduces the Comp-DD benchmark for complex scenarios.
Findings
EDF outperforms state-of-the-art methods in complex scenarios
The approach effectively emphasizes discriminative features using Grad-CAM
The new benchmark facilitates research in complex dataset distillation
Abstract
Dataset distillation has demonstrated strong performance on simple datasets like CIFAR, MNIST, and TinyImageNet but struggles to achieve similar results in more complex scenarios. In this paper, we propose EDF (emphasizes the discriminative features), a dataset distillation method that enhances key discriminative regions in synthetic images using Grad-CAM activation maps. Our approach is inspired by a key observation: in simple datasets, high-activation areas typically occupy most of the image, whereas in complex scenarios, the size of these areas is much smaller. Unlike previous methods that treat all pixels equally when synthesizing images, EDF uses Grad-CAM activation maps to enhance high-activation areas. From a supervision perspective, we downplay supervision signals that have lower losses, as they contain common patterns. Additionally, to help the DD community better explore…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Processing Techniques · Data Stream Mining Techniques · Fuzzy Logic and Control Systems
