Evolution-aware VAriance (EVA) Coreset Selection for Medical Image Classification
Yuxin Hong, Xiao Zhang, Xin Zhang, Joey Tianyi Zhou

TL;DR
This paper introduces EVA, a novel coreset selection method for medical image classification that effectively reduces data size while maintaining high accuracy, outperforming existing methods especially at high compression rates.
Contribution
EVA is a new coreset selection strategy that captures model evolution and sample importance fluctuations, tailored for medical imaging datasets.
Findings
EVA achieves 98.27% accuracy with only 10% training data.
EVA outperforms baseline methods and random selection at high compression rates.
EVA demonstrates significant potential for resource-efficient medical image analysis.
Abstract
In the medical field, managing high-dimensional massive medical imaging data and performing reliable medical analysis from it is a critical challenge, especially in resource-limited environments such as remote medical facilities and mobile devices. This necessitates effective dataset compression techniques to reduce storage, transmission, and computational cost. However, existing coreset selection methods are primarily designed for natural image datasets, and exhibit doubtful effectiveness when applied to medical image datasets due to challenges such as intra-class variation and inter-class similarity. In this paper, we propose a novel coreset selection strategy termed as Evolution-aware VAriance (EVA), which captures the evolutionary process of model training through a dual-window approach and reflects the fluctuation of sample importance more precisely through variance measurement.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
