Graph-enhanced multimodal fusion of vascular biomarkers and deep features for diabetic retinopathy detection
K. V. Deepsahith, Basineni Shashank, Bangipavan Kumar, Sherly Alphonse, Brindha Subburaj, Girish Subramanian

TL;DR
This paper introduces a new method for detecting diabetic retinopathy by combining deep learning features and vascular biomarkers using a transformer-based fusion model.
Contribution
The novel contribution is a graph-enhanced multimodal fusion framework using transformer cross-attention for diabetic retinopathy detection.
Findings
The model achieves 93.8% accuracy and 0.96 AUC-ROC on the Messidor-2 dataset.
It outperforms existing methods with above 98% accuracy on Eyepacs and APTOS 2019 datasets.
Abstract
Diabetic retinopathy (DR) detection can be performed through both deep retinal representations and vascular biomarkers. This proposed work suggests a multimodal framework that combines deep features with vascular descriptors in transformer fusion architecture. Fundus images are preprocessed using CLAHE, Canny edge detection, Top-hat transformation, and U-Net vessel segmentation. Then, the images are passed through a convolutional block attention module (CBAM)-fused enhanced MobileNetV3 backbone for deep spatial feature extraction. In parallel, the segmented vasculature is skeletonized to create a vascular graph, and the descriptors are computed using fractal dimension analysis (FDA), artery-to-vein ratio (AVR), and gray level co-occurrence matrix (GLCM) texture. A graph neural network (GNN) then generates a global topology-aware embedding using all that information. The different…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRetinal Imaging and Analysis · Retinal Diseases and Treatments · COVID-19 diagnosis using AI
