Spectral Gradient Surgery for Domain-Generalizable Dataset Distillation
Minyoung Oh, Najeong Chae, and Jae-Young Sim

TL;DR
This paper introduces DGDD, a new dataset distillation setting focused on out-of-distribution generalization, and proposes Spectral Gradient Surgery to improve the robustness of synthetic datasets against domain shifts.
Contribution
It formulates a novel DG setting for dataset distillation and proposes SGS, a spectral domain technique to disentangle class and domain information, enhancing OOD generalization.
Findings
SGS significantly improves OOD generalization in dataset distillation.
SGS is compatible with existing distribution matching methods.
Experiments show substantial performance gains across benchmarks.
Abstract
Dataset Distillation (DD) synthesizes a compact synthetic dataset that preserves the training utility of a full dataset. However, its standard formulation assumes that test data follow the same distribution as training data, an assumption that rarely holds in practice. A straightforward extension-applying post-hoc Domain Generalization (DG) techniques to distilled data-is ill-suited because existing DG methods rely on the natural diversity of real datasets, which compact synthetic sets inherently lack, while also incurring substantial augmentation overhead that conflicts with the efficiency objective of dataset distillation. To address this limitation, we introduce Domain Generalizable Dataset Distillation (DGDD), a new problem setting that explicitly targets out-of-distribution (OOD) generalization of distilled datasets. We study this problem through a widely adopted DD baseline of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
