Diabetic Retinopathy Grading with CLIP-based Ranking-Aware Adaptation:A Comparative Study on Fundus Image

Sungjun Cho

arXiv:2603.13403·cs.CV·March 17, 2026

Diabetic Retinopathy Grading with CLIP-based Ranking-Aware Adaptation:A Comparative Study on Fundus Image

Sungjun Cho

PDF

Open Access

TL;DR

This study compares three CLIP-based methods for diabetic retinopathy grading, demonstrating that ranking-aware prompting and hybrid models significantly outperform zero-shot approaches in accuracy and clinical relevance.

Contribution

It introduces and evaluates three CLIP-based approaches, including a ranking-aware model, for multi-class DR severity grading on fundus images, highlighting their effectiveness.

Findings

01

Ranking-aware model achieves 93.42% accuracy and AUROC 0.9845.

02

Hybrid FCN-CLIP model achieves 92.49% accuracy and AUROC 0.99.

03

Both models outperform the zero-shot baseline by a large margin.

Abstract

Diabetic retinopathy (DR) is a leading cause of preventable blindness, and automated fundus image grading can play an important role in large-scale screening. In this work, we investigate three CLIP-based approaches for five-class DR severity grading: (1) a zero-shot baseline using prompt engineering, (2) a hybrid FCN-CLIP model augmented with CBAM attention, and (3) a ranking-aware prompting model that encodes the ordinal structure of DR progression. We train and evaluate on a combined dataset of APTOS 2019 and Messidor-2 (n=5,406), addressing class imbalance through resampling and class-specific optimal thresholding. Our experiments show that the ranking-aware model achieves the highest overall accuracy (93.42%, AUROC 0.9845) and strong recall on clinically critical severe cases, while the hybrid FCN-CLIP model (92.49%, AUROC 0.99) excels at detecting proliferative DR. Both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRetinal Imaging and Analysis · Retinal Diseases and Treatments · COVID-19 diagnosis using AI