Dataset Distillation by Automatic Training Trajectories

Dai Liu; Jindong Gu; Hu Cao; Carsten Trinitis; Martin Schulz

arXiv:2407.14245·cs.CV·July 22, 2024

Dataset Distillation by Automatic Training Trajectories

Dai Liu, Jindong Gu, Hu Cao, Carsten Trinitis, Martin Schulz

PDF

Open Access 1 Repo

TL;DR

This paper introduces Automatic Training Trajectories (ATT), a novel dataset distillation method that adaptively adjusts training trajectory length to improve generalization and stability across different neural network architectures.

Contribution

The paper proposes ATT, an adaptive approach to dataset distillation that dynamically adjusts training trajectory length to mitigate overfitting and improve cross-architecture performance.

Findings

01

Outperforms existing methods in cross-architecture tests

02

Enhances stability against parameter variations

03

Addresses the Accumulated Mismatching Problem (AMP)

Abstract

Dataset Distillation is used to create a concise, yet informative, synthetic dataset that can replace the original dataset for training purposes. Some leading methods in this domain prioritize long-range matching, involving the unrolling of training trajectories with a fixed number of steps (NS) on the synthetic dataset to align with various expert training trajectories. However, traditional long-range matching methods possess an overfitting-like problem, the fixed step size NS forces synthetic dataset to distortedly conform seen expert training trajectories, resulting in a loss of generality-especially to those from unencountered architecture. We refer to this as the Accumulated Mismatching Problem (AMP), and propose a new approach, Automatic Training Trajectories (ATT), which dynamically and adaptively adjusts trajectory length NS to address the AMP. Our method outperforms existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

NiaLiu/ATT
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSimulation Techniques and Applications · Machine Learning and Data Classification

MethodsAdversarial Model Perturbation · ALIGN