Hyperbolic Dataset Distillation
Wenyuan Li, Guang Li, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama

TL;DR
This paper introduces HDD, a hyperbolic dataset distillation method that models hierarchical data structures more effectively than Euclidean approaches, leading to more compact synthetic datasets and improved training stability.
Contribution
HDD is the first to incorporate hyperbolic space into dataset distillation, explicitly modeling hierarchical relationships to enhance synthetic data quality and efficiency.
Findings
Hyperbolic space embedding captures hierarchical data structures effectively.
Pruning in hyperbolic space retains 80% of data with minimal performance loss.
Hyperbolic distillation improves training stability and data compactness.
Abstract
To address the computational and storage challenges posed by large-scale datasets in deep learning, dataset distillation has been proposed to synthesize a compact dataset that replaces the original while maintaining comparable model performance. Unlike optimization-based approaches that require costly bi-level optimization, distribution matching (DM) methods improve efficiency by aligning the distributions of synthetic and original data, thereby eliminating nested optimization. DM achieves high computational efficiency and has emerged as a promising solution. However, existing DM methods, constrained to Euclidean space, treat data as independent and identically distributed points, overlooking complex geometric and hierarchical relationships. To overcome this limitation, we propose a novel hyperbolic dataset distillation method, termed HDD. Hyperbolic space, characterized by negative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Computational Physics and Python Applications
MethodsPruning · Sparse Evolutionary Training
