Feasibility of Multimodal Deep Learning for Automated Staging of Familial Exudative Vitreoretinopathy Using Color Fundus Photographs and Fluorescein Angiography
Mingzhen Yuan, Tianyu Wang, Zirong Liu, Jinghua Liu, Jing Ma, Guangda Deng, Liang Li, Songfeng Li, Yan Hu, Hai Lu

TL;DR
This paper explores using deep learning models to automatically stage a rare eye disease called FEVR using eye images and angiography.
Contribution
The study introduces a novel multimodal dataset and compares deep learning models for FEVR staging, showing that multimodal fusion outperforms single-modal approaches.
Findings
Transformers outperformed CNNs in single-modal analysis of FEVR staging.
CRD-Net achieved peak performance with AUC up to 0.94 in severe FEVR cases.
Multimodal deep learning models showed high specificity and accuracy comparable to specialists.
Abstract
Introduction: To evaluate the feasibility of multimodal deep learning (DL) for automated staging of familial exudative vitreoretinopathy (FEVR) using color fundus photographs (CFP) and fluorescein angiography (FFA). Methods: We assembled a multimodal dataset across FEVR stages 0–5 and post-laser cases and benchmarked CNNs (Convolutional Neural Networks), Transformers, and multimodal fusion under center-region and multi-image settings. Class imbalance was mitigated via weighted sampling and focal/class-balanced losses. We report accuracy, recall, precision, macro-F1, Cohen’s κ, and class-wise ROC/AUC with 95% Cis. Results: AI system showed balanced performance versus specialists (0.65 vs. Dr. A: 0.48/Dr. B: 0.48) in CFP assessment, maintaining high specificity (0.91–0.92). Among architectures: (1) Transformers outperformed CNNs in single-modal analysis; (2) ResNet showed moderate…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRetinal and Macular Surgery · Retinal Imaging and Analysis · Retinal Diseases and Treatments
