Enhancing Dataset Distillation via Label Inconsistency Elimination and   Learning Pattern Refinement

Chuhao Zhou; Chenxi Jiang; Yi Xie; Haozhi Cao; Jianfei Yang

arXiv:2410.13311·cs.CV·October 18, 2024

Enhancing Dataset Distillation via Label Inconsistency Elimination and Learning Pattern Refinement

Chuhao Zhou, Chenxi Jiang, Yi Xie, Haozhi Cao, Jianfei Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces M-DATM, an improved dataset distillation method that eliminates label inconsistency and refines learning patterns, achieving top results in the ECCV-2024 challenge.

Contribution

It proposes modifications to DATM by removing soft labels and reducing matching range, enhancing dataset distillation performance.

Findings

01

Achieved 0.4061 accuracy on CIFAR-100

02

Achieved 0.1831 accuracy on Tiny ImageNet

03

Ranked 1st in ECCV-2024 Data Distillation Challenge

Abstract

Dataset Distillation (DD) seeks to create a condensed dataset that, when used to train a model, enables the model to achieve performance similar to that of a model trained on the entire original dataset. It relieves the model training from processing massive data and thus reduces the computation resources, storage, and time costs. This paper illustrates our solution that ranks 1st in the ECCV-2024 Data Distillation Challenge (track 1). Our solution, Modified Difficulty-Aligned Trajectory Matching (M-DATM), introduces two key modifications to the original state-of-the-art method DATM: (1) the soft labels learned by DATM do not achieve one-to-one correspondence with the counterparts generated by the official evaluation script, so we remove the soft labels technique to alleviate such inconsistency; (2) since the removal of soft labels makes it harder for the synthetic dataset to learn late…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chuhaozhou99/m-datm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification