Progressive trajectory matching for medical dataset distillation

Zhen Yu; Yang Liu; Qingchao Chen

arXiv:2403.13469·cs.CV·March 21, 2024·1 cites

Progressive trajectory matching for medical dataset distillation

Zhen Yu, Yang Liu, Qingchao Chen

PDF

Open Access

TL;DR

This paper introduces a progressive trajectory matching approach for medical dataset distillation, enhancing stability and diversity of synthetic datasets to improve model training while preserving privacy.

Contribution

It proposes a novel progressive trajectory matching strategy and a dynamic overlap mitigation module specifically designed for medical image dataset distillation, along with a new benchmark for evaluation.

Findings

01

Achieves 8.33% improvement over previous methods on average.

02

Improves performance by 11.7% when image per class is 2.

03

Enhances training stability and synthetic dataset diversity.

Abstract

It is essential but challenging to share medical image datasets due to privacy issues, which prohibit building foundation models and knowledge transfer. In this paper, we propose a novel dataset distillation method to condense the original medical image datasets into a synthetic one that preserves useful information for building an analysis model without accessing the original datasets. Existing methods tackle only natural images by randomly matching parts of the training trajectories of the model parameters trained by the whole real datasets. However, through extensive experiments on medical image datasets, the training process is extremely unstable and achieves inferior distillation results. To solve these barriers, we propose to design a novel progressive trajectory matching strategy to improve the training stability for medical image dataset distillation. Additionally, it is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTime Series Analysis and Forecasting