Native Intelligence Emerges from Large-Scale Clinical Practice: A Retinal Foundation Model with Deployment Efficiency
Jia Guo, Jiawei Du, Shengzhu Yang, Shuai Lu, Wenquan Cheng, Kaiwen Zhang, Yihua Sun, Chuhong Yang, Weihang Zhang, Fang Chen, Yilan Wu, Lie Ju, Guochen Ning, Longfei Ma, Huiping Yao, Jinyuan Wang, Peilun Shi, Yukun Zhou, Jie Xu, Pearse A. Keane, Hanruo Liu, Hongen Liao

TL;DR
This paper introduces ReVision, a retinal foundation model trained on real-world telemedicine data, demonstrating high accuracy and deployment efficiency across diverse clinical tasks without extensive task-specific training.
Contribution
The study shows that large-scale clinical practice data can be used to build effective retinal AI models, reducing the need for extensive annotation and fine-tuning.
Findings
Achieves zero-shot disease detection with AUROC of 0.946 and 0.952 on benchmarks and cohorts.
Matches fine-tuned models with fewer parameters and labeled data.
Improves ophthalmologists' diagnostic accuracy by 14.8% in a reader study.
Abstract
Current retinal foundation models remain constrained by curated research datasets that lack authentic clinical context, and require extensive task-specific optimization for each application, limiting their deployment efficiency in low-resource settings. Here, we show that these barriers can be overcome by building clinical native intelligence directly from real-world medical practice. Our key insight is that large-scale telemedicine programs, where expert centers provide remote consultations across distributed facilities, represent a natural reservoir for learning clinical image interpretation. We present ReVision, a retinal foundation model that learns from the natural alignment between 485,980 color fundus photographs and their corresponding diagnostic reports, accumulated through a decade-long telemedicine program spanning 162 medical institutions across China. Through extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRetinal Imaging and Analysis · EEG and Brain-Computer Interfaces · Telemedicine and Telehealth Implementation
