Post Training Quantization for Efficient Dataset Condensation
Linh-Tam Tran, Sung-Ho Bae

TL;DR
This paper introduces a novel post-training quantization method for dataset condensation that significantly reduces storage size while maintaining high representation quality, especially effective at extremely low bit-widths.
Contribution
It proposes a patch-based quantization approach with clustering and a distribution alignment module, enabling efficient, low-bit dataset condensation without retraining.
Findings
Nearly doubles test accuracy at extreme compression regimes
Outperforms prior methods on CIFAR-10/100, Tiny ImageNet, and ImageNet subsets
Operates effectively at 2-bit images without additional distillation
Abstract
Dataset Condensation (DC) distills knowledge from large datasets into smaller ones, accelerating training and reducing storage requirements. However, despite notable progress, prior methods have largely overlooked the potential of quantization for further reducing storage costs. In this paper, we take the first step to explore post-training quantization in dataset condensation, demonstrating its effectiveness in reducing storage size while maintaining representation quality without requiring expensive training cost. However, we find that at extremely low bit-widths (e.g., 2-bit), conventional quantization leads to substantial degradation in representation quality, negatively impacting the networks trained on these data. To address this, we propose a novel \emph{patch-based post-training quantization} approach that ensures localized quantization with minimal loss of information. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
