T-DuMpRa: Teacher-guided Dual-path Multi-prototype Retrieval Augmented framework for fine-grained medical image classification
Zixuan Tang, Shen Zhao

TL;DR
T-DuMpRa is a framework that combines discriminative classification with multi-prototype retrieval, guided by a teacher model, to improve fine-grained medical image classification, especially in ambiguous cases.
Contribution
It introduces a plug-and-play dual-path retrieval-augmented framework with a teacher-guided prototype memory bank for better handling ambiguous medical images.
Findings
Improves accuracy on HAM10000 and ISIC2019 datasets.
Enhances model's ability to handle visually ambiguous cases.
Achieves up to 2.69% performance improvement across multiple backbones.
Abstract
Fine-grained medical image classification is challenged by subtle inter-class variations and visually ambiguous cases, where confidence estimates often exhibit uncertainty rather than being overconfident. In such scenarios, purely discriminative classifiers may achieve high overall accuracy yet still fail to distinguish between highly similar categories, leading to miscalibrated predictions. We propose T-DuMpRa, a teacher-guided dual-path multi-prototype retrieval-augmented framework, where discriminative classification and multi-prototype retrieval jointly drive both training and prediction. During training, we jointly optimize cross-entropy and supervised contrastive objectives to learn a cosine-compatible embedding geometry for reliable prototype matching. We further employ an exponential moving average (EMA) teacher to obtain smoother representations and build a multi-prototype…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
