Accelerating Large-Scale Dataset Distillation via Exploration-Exploitation Optimization
Muhammad J. Alahmadi, Peng Gao, Feiyi Wang, Dongkuan Xu

TL;DR
This paper introduces E$^2$D, an exploration-exploitation optimization method for large-scale dataset distillation that significantly improves efficiency and accuracy, enabling faster training with smaller datasets.
Contribution
The paper proposes a novel two-phase optimization strategy for dataset distillation that balances accuracy and efficiency, outperforming existing methods on large-scale benchmarks.
Findings
E$^2$D achieves state-of-the-art accuracy on ImageNet-1K.
E$^2$D is 18 times faster than previous methods on ImageNet-1K.
E$^2$D improves accuracy and speed on ImageNet-21K.
Abstract
Dataset distillation compresses the original data into compact synthetic datasets, reducing training time and storage while retaining model performance, enabling deployment under limited resources. Although recent decoupling-based distillation methods enable dataset distillation at large scale, they continue to face an efficiency gap: optimization-based decoupling methods achieve higher accuracy but demand intensive computation, whereas optimization-free decoupling methods are efficient but sacrifice accuracy. To overcome this trade-off, we propose Exploration--Exploitation Distillation (ED), a simple, practical method that minimizes redundant computation through an efficient pipeline that begins with full-image initialization to preserve semantic integrity and feature diversity. It then uses a two-phase optimization strategy: an exploration phase that performs uniform updates and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis
