CLIP-DR: Textual Knowledge-Guided Diabetic Retinopathy Grading with Ranking-aware Prompting
Qinkai Yu, Jianyang Xie, Anh Nguyen, He Zhao, Jiong Zhang, Huazhu Fu,, Yitian Zhao, Yalin Zheng, Yanda Meng

TL;DR
This paper introduces CLIP-DR, a novel framework leveraging pre-trained visual language models with ranking-aware prompting and similarity smoothing to improve diabetic retinopathy severity grading's robustness and accuracy.
Contribution
It proposes a ranking-aware prompting strategy and a similarity matrix smoothing module to enhance CLIP's performance for DR grading, addressing data variability and class imbalance.
Findings
Outperforms state-of-the-art methods on GDRBench benchmark.
Demonstrates robustness to data variability and class imbalance.
Achieves superior accuracy in DR severity classification.
Abstract
Diabetic retinopathy (DR) is a complication of diabetes and usually takes decades to reach sight-threatening levels. Accurate and robust detection of DR severity is critical for the timely management and treatment of diabetes. However, most current DR grading methods suffer from insufficient robustness to data variability (\textit{e.g.} colour fundus images), posing a significant difficulty for accurate and robust grading. In this work, we propose a novel DR grading framework CLIP-DR based on three observations: 1) Recent pre-trained visual language models, such as CLIP, showcase a notable capacity for generalisation across various downstream tasks, serving as effective baseline models. 2) The grading of image-text pairs for DR often adheres to a discernible natural sequence, yet most existing DR grading methods have primarily overlooked this aspect. 3) A long-tailed distribution among…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRetinal Imaging and Analysis · Imbalanced Data Classification Techniques · Artificial Intelligence in Healthcare
MethodsContrastive Language-Image Pre-training
