Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling
Xiao Cui, Yulei Qin, Xinyue Li, Wengang Zhou, Hongsheng Li, Houqiang Li

TL;DR
This paper proposes a novel framework for long-tailed dataset distillation that addresses bias and statistical estimation issues by using unbiased recovery and relabeling techniques, significantly improving performance on imbalanced datasets.
Contribution
The paper introduces a unified level framework with three components for unbiased recovery and relabeling, addressing limitations of trajectory-based methods in long-tailed dataset distillation.
Findings
Achieves 15.6% top-1 accuracy improvement on CIFAR-100-LT
Achieves 11.8% top-1 accuracy improvement on Tiny-ImageNet-LT
Consistently outperforms state-of-the-art methods across various class imbalance levels
Abstract
Dataset distillation creates a small distilled set that enables efficient training by capturing key information from the full dataset. While existing dataset distillation methods perform well on balanced datasets, they struggle under long-tailed distributions, where imbalanced class frequencies induce biased model representations and corrupt statistical estimates such as Batch Normalization (BN) statistics. In this paper, we rethink long-tailed dataset distillation by revisiting the limitations of trajectory-based methods, and instead adopt the statistical alignment perspective to jointly mitigate model bias and restore fair supervision. To this end, we introduce three dedicated components that enable unbiased recovery of distilled images and soft relabeling: (1) enhancing expert models (an observer model for recovery and a teacher model for relabeling) to enable reliable statistics…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications
