D$^4$M: Dataset Distillation via Disentangled Diffusion Model

Duo Su; Junjie Hou; Weizhi Gao; Yingjie Tian; Bowen Tang

arXiv:2407.15138·cs.CV·July 23, 2024

D$^4$M: Dataset Distillation via Disentangled Diffusion Model

Duo Su, Junjie Hou, Weizhi Gao, Yingjie Tian, Bowen Tang

PDF

Open Access 1 Repo

TL;DR

D$^4$M introduces an architecture-independent dataset distillation method using a disentangled diffusion model, achieving better generalization and efficiency compared to existing approaches.

Contribution

The paper proposes a novel, architecture-independent dataset distillation framework leveraging latent diffusion models and label-informed prototypes, improving cross-architecture performance.

Findings

01

Outperforms state-of-the-art methods in most metrics.

02

Demonstrates strong cross-architecture generalization.

03

Reduces computational costs for large-scale datasets.

Abstract

Dataset distillation offers a lightweight synthetic dataset for fast network training with promising test accuracy. To imitate the performance of the original dataset, most approaches employ bi-level optimization and the distillation space relies on the matching architecture. Nevertheless, these approaches either suffer significant computational costs on large-scale datasets or experience performance decline on cross-architectures. We advocate for designing an economical dataset distillation framework that is independent of the matching architectures. With empirical observations, we argue that constraining the consistency of the real and synthetic image spaces will enhance the cross-architecture generalization. Motivated by this, we introduce Dataset Distillation via Disentangled Diffusion Model (D $^{4}$ M), an efficient framework for dataset distillation. Compared to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

richards94/D4M
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsLatent Diffusion Model · Diffusion