Curriculum Fine-tuning of Vision Foundation Model for Medical Image Classification Under Label Noise
Yeonguk Yu, Minhwan Ko, Sungho Shin, Kangmin Kim, Kyoobin Lee

TL;DR
This paper introduces CUFIT, a curriculum fine-tuning approach for vision foundation models that effectively handles noisy labels in medical image classification, outperforming previous methods across multiple benchmarks.
Contribution
The paper proposes a novel curriculum fine-tuning paradigm leveraging pre-trained vision models and linear probing to improve medical image classification under label noise, a setting not previously addressed.
Findings
CUFIT surpasses previous methods by up to 5.8% at 40% noise rate.
The approach achieves higher label precision and recall in noisy label detection.
Experimental results demonstrate significant performance improvements across various datasets.
Abstract
Deep neural networks have demonstrated remarkable performance in various vision tasks, but their success heavily depends on the quality of the training data. Noisy labels are a critical issue in medical datasets and can significantly degrade model performance. Previous clean sample selection methods have not utilized the well pre-trained features of vision foundation models (VFMs) and assumed that training begins from scratch. In this paper, we propose CUFIT, a curriculum fine-tuning paradigm of VFMs for medical image classification under label noise. Our method is motivated by the fact that linear probing of VFMs is relatively unaffected by noisy samples, as it does not update the feature extractor of the VFM, thus robustly classifying the training samples. Subsequently, curriculum fine-tuning of two adapters is conducted, starting with clean sample selection from the linear probing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMedical Imaging and Analysis · Radiomics and Machine Learning in Medical Imaging
