# Performance and Clinical Utility of Deep Learning for Detecting Referable Age-Related Macular Degeneration on Fundus Photographs: A Systematic Review and Meta-Analysis

**Authors:** Wei-Ting Luo, Ting-Wei Wang

PMC · DOI: 10.3390/diagnostics16040633 · 2026-02-22

## TL;DR

This paper reviews how well deep learning can detect serious age-related eye disease from photos and finds it performs well compared to human experts.

## Contribution

The study provides a systematic review and meta-analysis of deep learning's diagnostic accuracy for referable AMD detection using fundus photographs.

## Key findings

- DL algorithms showed high pooled sensitivity (0.91) and specificity (0.93) for detecting referable AMD.
- DL systems had higher specificity than human graders but slightly lower sensitivity.
- The pooled positive and negative likelihood ratios suggest strong diagnostic utility of DL.

## Abstract

Background/Objectives: Age-related macular degeneration (AMD) is a leading cause of irreversible central vision loss in older adults. Detection of referable AMD—typically intermediate or advanced disease requiring specialist evaluation—is critical for timely intervention. Deep learning (DL) applied to color fundus photographs has emerged as a potential tool to support large-scale AMD screening. This systematic review and meta-analysis evaluated the diagnostic accuracy of DL algorithms for detecting referable AMD and compared their performance with human graders. Methods: We systematically searched PubMed, Embase, Web of Science, and IEEE Xplore through 18 December 2025. Diagnostic accuracy studies assessing DL algorithms on color fundus photographs for referable AMD in adults were included. Two reviewers independently screened studies, extracted data, and assessed risk of bias using an AI-adapted PROBAST framework. Pooled sensitivity and specificity were estimated using a bivariate random-effects model. Clinical utility was evaluated using likelihood ratios, and paired head-to-head comparisons were synthesized using a contrast-based meta-analysis. Results: Fourteen studies were included. DL algorithms achieved a pooled sensitivity of 0.91 (95% CI: 0.86–0.94) and specificity of 0.93 (95% CI: 0.86–0.96), with substantial heterogeneity. The pooled positive and negative likelihood ratios were 12.22 and 0.10, respectively, indicating strong diagnostic utility. In direct comparisons, DL systems showed slightly lower sensitivity but higher specificity than human graders. Conclusions: Deep learning demonstrates high diagnostic accuracy for detecting referable AMD from fundus photographs and may support screening and referral workflows. Further prospective validation and standardized evaluation are needed before widespread clinical implementation.

## Linked entities

- **Diseases:** age-related macular degeneration (MONDO:0005150), AMD (MONDO:0005150)

## Full-text entities

- **Genes:** VEGFA (vascular endothelial growth factor A) [NCBI Gene 7422] {aka L-VEGF, MVCD1, VEGF, VPF}
- **Diseases:** WT (MESH:D009396), DR (MESH:D004370), Diabetic Retinopathy (MESH:D003930), Age-Related Eye Disease (MESH:D005128), glaucoma (MESH:D005901), retinal disease (MESH:D012164), Degeneration (MESH:D009410), AMD (MESH:D008268), diabetic (MESH:D003920), Drusen (MESH:D015593), vision loss (MESH:D014786), injury to (MESH:D014947), DL (MESH:D007859), Geographic Atrophy (MESH:D057092)
- **Chemicals:** fluorescein (MESH:D019793), DTA (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12939426/full.md

---
Source: https://tomesphere.com/paper/PMC12939426