Comparative Analysis of Deep Learning Strategies for Hypertensive Retinopathy Detection from Fundus Images: From Scratch and Pre-trained Models
Yanqiao Zhu

TL;DR
This study compares deep learning methods for hypertensive retinopathy detection from fundus images, highlighting how data augmentation affects different architectures and emphasizing the importance of model choice and data diversity.
Contribution
It provides a detailed analysis of how architecture and data augmentation interact, offering insights into optimal strategies for medical image classification with limited data.
Findings
Augmentation boosts pure ViTs but degrades hybrid ViT-CNNs.
Smaller patch sizes improve fine-grained detail detection.
Self-supervised models like DINOv2 benefit from augmentation with limited data.
Abstract
This paper presents a comparative analysis of deep learning strategies for detecting hypertensive retinopathy from fundus images, a central task in the HRDC challenge~\cite{qian2025hrdc}. We investigate three distinct approaches: a custom CNN, a suite of pre-trained transformer-based models, and an AutoML solution. Our findings reveal a stark, architecture-dependent response to data augmentation. Augmentation significantly boosts the performance of pure Vision Transformers (ViTs), which we hypothesize is due to their weaker inductive biases, forcing them to learn robust spatial and structural features. Conversely, the same augmentation strategy degrades the performance of hybrid ViT-CNN models, whose stronger, pre-existing biases from the CNN component may be "confused" by the transformations. We show that smaller patch sizes (ViT-B/8) excel on augmented data, enhancing fine-grained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRetinal Imaging and Analysis · Retinal and Optic Conditions
