Frequency Domain-based Dataset Distillation
Donghyeok Shin, Seungjae Shin, Il-Chul Moon

TL;DR
FreD introduces a frequency domain-based approach for dataset distillation, efficiently creating small synthetic datasets by selecting key frequency components, outperforming traditional spatial domain methods.
Contribution
It proposes a novel frequency domain parameterization method for dataset distillation that improves efficiency and performance over existing spatial domain approaches.
Findings
FreD reduces the synthesis budget significantly.
It preserves dataset information better than conventional methods.
FreD enhances existing distillation methods' performance.
Abstract
This paper presents FreD, a novel parameterization method for dataset distillation, which utilizes the frequency domain to distill a small-sized synthetic dataset from a large-sized original dataset. Unlike conventional approaches that focus on the spatial domain, FreD employs frequency-based transforms to optimize the frequency representations of each data instance. By leveraging the concentration of spatial domain information on specific frequency components, FreD intelligently selects a subset of frequency dimensions for optimization, leading to a significant reduction in the required budget for synthesizing an instance. Through the selection of frequency dimensions based on the explained variance, FreD demonstrates both theoretical and empirical evidence of its ability to operate efficiently within a limited budget, while better preserving the information of the original dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsCancer-related molecular mechanisms research
MethodsFocus
