FAST: Topology-Aware Frequency-Domain Distribution Matching for Coreset Selection
Jin Cui, Boran Zhao, Jiajun Xu, Jiaqi Guo, Shuo Guan, Pengju Ren

TL;DR
FAST introduces a novel DNN-free, spectral graph theory-based coreset selection method that effectively matches data distributions in the frequency domain, leading to significant improvements in accuracy, efficiency, and energy savings.
Contribution
It is the first to formulate coreset selection as a spectral graph optimization problem using characteristic function distance, addressing distributional constraints without relying on DNNs.
Findings
Outperforms state-of-the-art methods with 9.12% accuracy gain
Reduces power consumption by 96.57%
Achieves 2.2x speedup in training
Abstract
Coreset selection compresses large datasets into compact, representative subsets, reducing the energy and computational burden of training deep neural networks. Existing methods are either: (i) DNN-based, which are tied to model-specific parameters and introduce architectural bias; or (ii) DNN-free, which rely on heuristics lacking theoretical guarantees. Neither approach explicitly constrains distributional equivalence, largely because continuous distribution matching is considered inapplicable to discrete sampling. Moreover, prevalent metrics (e.g., MSE, KL, CE, MMD) cannot accurately capture higher-order moment discrepancies, leading to suboptimal coresets. In this work, we propose FAST, the first DNN-free distribution-matching coreset selection framework that formulates the coreset selection task as a graph-constrained optimization problem grounded in spectral graph theory and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Advanced Graph Neural Networks
