Dataset Distillation with Neural Characteristic Function: A Minmax Perspective
Shaobo Wang, Yicun Yang, Zhiyuan Liu, Chenghao Sun, Xuming, Hu, Conghui He, Linfeng Zhang

TL;DR
This paper introduces Neural Characteristic Function Discrepancy (NCFD), a new metric for dataset distillation that improves distribution matching accuracy and efficiency, enabling high-quality synthetic data generation with significantly reduced memory and computational costs.
Contribution
The paper proposes NCFD, a theoretically grounded distribution discrepancy measure using neural characteristic functions, and develops Neural Characteristic Function Matching (ymethod{}) for improved dataset distillation.
Findings
Achieves 20.5% accuracy boost on ImageSquawk.
Reduces GPU memory usage by over 300 times.
Faster processing speeds, enabling lossless CIFAR-100 compression.
Abstract
Dataset distillation has emerged as a powerful approach for reducing data requirements in deep learning. Among various methods, distribution matching-based approaches stand out for their balance of computational efficiency and strong performance. However, existing distance metrics used in distribution matching often fail to accurately capture distributional differences, leading to unreliable measures of discrepancy. In this paper, we reformulate dataset distillation as a minmax optimization problem and introduce Neural Characteristic Function Discrepancy (NCFD), a comprehensive and theoretically grounded metric for measuring distributional differences. NCFD leverages the Characteristic Function (CF) to encapsulate full distributional information, employing a neural network to optimize the sampling strategy for the CF's frequency arguments, thereby maximizing the discrepancy to enhance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
