Targeting vision-threatening disease in EyeArt-based diabetic retinopathy screening: A letter in response to: “Evaluation of the degree of agreement in the diagnosis of diabetic retinopathy between ophthalmologists and EyeArt”
Henry Bair

TL;DR
This letter argues that diabetic retinopathy screening should focus on preventing vision-threatening cases rather than all cases, to better balance accuracy and efficiency.
Contribution
The paper proposes prioritizing vision-threatening DR as the primary endpoint in screening programs and suggests integrating multimodal AI systems for better triage.
Findings
Current screening metrics optimized for any DR may not balance sensitivity, specificity, and referral burden effectively.
OCT-based risk factors support the development of AI systems combining imaging and clinical data to triage vtDR.
Multimodal AI could enable more efficient screening and personalized follow-up intervals.
Abstract
Guedes et al. demonstrate excellent agreement between EyeArt and ophthalmologist grading for any diabetic retinopathy (DR) within a regional screening program, with near-perfect binocular sensitivity and high specificity. Building on these findings, this letter argues that screening programs are primarily justified by the prevention of vision-threatening DR (vtDR), and that performance metrics should therefore prioritize vtDR rather than any DR as the primary endpoint. The letter highlights how the current operating point, optimized for any DR, may not reflect an optimal balance between sensitivity, specificity, and downstream referral burden in real-world workflows. It also discusses how the study’s OCT-based risk factors support the development of multimodal and risk-integrated AI systems that combine fundus imaging, OCT, and clinical variables to triage vtDR more efficiently and…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRetinal Imaging and Analysis · Retinal Diseases and Treatments · Retinal and Optic Conditions
To the Editor
Guedes et al. report excellent agreement between EyeArt and ophthalmologist grading for diabetic retinopathy (DR) in a regional screening program, with binocular sensitivity of 100% and specificity of 93.5% [1]. This strengthens the case for autonomous AI in DR screening. However, an aspect that warrants further exploration is the choice of “any DR” as the primary endpoint rather than vision‑threatening DR (vtDR).
From a health‑system perspective, screening is principally justified by prevention of blindness, and guidelines emphasize detection of vtDR (proliferative DR and/or centre‑involving diabetic macular oedema) [2]. Prior EyeArt evaluations have reported performance specifically for vtDR, often with sensitivity similar to that for any DR but with improved specificity when thresholds are tuned to this endpoint [3, 4]. In the present study, the near‑perfect sensitivity for any DR suggests that a modest relaxation of sensitivity in favor of higher specificity might be acceptable if the endpoint were vtDR, particularly in settings where referral capacity is constrained. Reporting EyeArt’s binocular sensitivity and specificity for vtDR in this cohort, and modelling expected referral volumes under different operating points, would therefore be highly informative.
The authors show that OCT‑defined diabetic macular edema and increased central retinal thickness are strongly associated with DR, and that these associations are recapitulated in AI‑based models. Yet EyeArt currently analyses only color fundus photographs. Taken together, these findings support development of multimodal AI systems that integrate fundus imaging with OCT (and potentially OCT‑angiography) to directly target vtDR rather than extrapolating from surrogate fundus features alone. Such systems could triage vtDR at the point of care, while fundus‑only AI remains an efficient front‑end option for clinics without OCT access.
Finally, the inverse association between age and DR after adjustment for diabetes duration hints at heterogeneous risk trajectories, consistent with prior epidemiologic data [2]. Incorporating age of onset and cumulative glycemic exposure into AI‑driven risk stratification, alongside image‑based outputs, might allow personalized screening intervals beyond a simple binary AI read‑out.
Re‑analysis of the Guedes et al. cohort using vtDR as the primary endpoint, and exploration of multimodal and risk‑integrated AI workflows, may provide readers with clinically actionable insight into how high‑performing algorithms such as EyeArt can be optimally deployed in real‑world screening programs.
