Dataset Distillation in Medical Imaging: A Feasibility Study
Muyang Li, Can Cui, Quan Liu, Ruining Deng, Tianyuan Yao, Marilyn, Lionts, Yuankai Huo

TL;DR
This study investigates the feasibility of applying data distillation techniques to medical imaging datasets, demonstrating that significant data reduction is possible without sacrificing model performance, thus enabling efficient and secure data sharing.
Contribution
It provides the first comprehensive evaluation of data distillation methods in medical imaging, showing their potential for reducing dataset size while maintaining accuracy.
Findings
Data distillation significantly reduces dataset size.
Small, representative samples can predict distillation success.
Distillation maintains model performance comparable to full datasets.
Abstract
Data sharing in the medical image analysis field has potential yet remains underappreciated. The aim is often to share datasets efficiently with other sites to train models effectively. One possible solution is to avoid transferring the entire dataset while still achieving similar model performance. Recent progress in data distillation within computer science offers promising prospects for sharing medical data efficiently without significantly compromising model effectiveness. However, it remains uncertain whether these methods would be applicable to medical imaging, since medical and natural images are distinct fields. Moreover, it is intriguing to consider what level of performance could be achieved with these methods. To answer these questions, we conduct investigations on a variety of leading data distillation methods, in different contexts of medical imaging. We evaluate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Artificial Intelligence in Healthcare
