Dataset Distillation via Factorization
Songhua Liu, Kai Wang, Xingyi Yang, Jingwen Ye, Xinchao Wang

TL;DR
This paper introduces HaBa, a novel dataset distillation method that decomposes datasets into hallucination networks and bases, significantly improving data efficiency and classification accuracy while reducing parameters.
Contribution
The paper proposes a new dataset factorization approach, HaBa, which enhances dataset distillation by decomposing data into components, increasing informativeness and diversity, and improving downstream task performance.
Findings
Achieves up to 65% reduction in compressed parameters.
Improves classification accuracy by approximately 10% in cross-architecture tests.
Demonstrates significant performance gains over previous state-of-the-art methods.
Abstract
In this paper, we study \xw{dataset distillation (DD)}, from a novel perspective and introduce a \emph{dataset factorization} approach, termed \emph{HaBa}, which is a plug-and-play strategy portable to any existing DD baseline. Unlike conventional DD approaches that aim to produce distilled and representative samples, \emph{HaBa} explores decomposing a dataset into two components: data \emph{Ha}llucination networks and \emph{Ba}ses, where the latter is fed into the former to reconstruct image samples. The flexible combinations between bases and hallucination networks, therefore, equip the distilled data with exponential informativeness gain, which largely increase the representation capability of distilled datasets. To furthermore increase the data efficiency of compression results, we further introduce a pair of adversarial contrastive constraints on the resultant hallucination…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Advanced Image Processing Techniques · Digital Media Forensic Detection
