FADRM: Fast and Accurate Data Residual Matching for Dataset Distillation
Jiacheng Cui, Xinyue Bi, Yaxin Luo, Xiaohan Zhao, Jiacheng Liu, Zhiqiang Shen

TL;DR
FADRM introduces a novel data residual matching technique for dataset distillation that leverages data-level skip connections, significantly improving efficiency and performance over existing methods.
Contribution
The paper pioneers Data Residual Matching at the data level, combining pixel space optimization with data residuals to enhance dataset distillation.
Findings
Achieves 47.7% test accuracy on ImageNet-1K with ResNet-18 at 0.8% compression.
Reduces training time and GPU memory usage by 50%.
Outperforms state-of-the-art methods like RDED, EDC, and CV-DD.
Abstract
Residual connection has been extensively studied and widely applied at the model architecture level. However, its potential in the more challenging data-centric approaches remains unexplored. In this work, we introduce the concept of Data Residual Matching for the first time, leveraging data-level skip connections to facilitate data generation and mitigate data information vanishing. This approach maintains a balance between newly acquired knowledge through pixel space optimization and existing core local information identification within raw data modalities, specifically for the dataset distillation task. Furthermore, by incorporating optimization-level refinements, our method significantly improves computational efficiency, achieving superior performance while reducing training time and peak GPU memory usage by 50%. Consequently, the proposed method Fast and Accurate Data Residual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Generative Adversarial Networks and Image Synthesis
