Training a high-performance retinal foundation model with half-the-data and 400 times less compute
Justin Engelmann, Miguel O. Bernabeu

TL;DR
This paper introduces RETFound-Green, a retinal foundation model trained with significantly less data and compute, achieving high performance and efficiency across multiple downstream tasks and datasets.
Contribution
The authors propose a novel Token Reconstruction training objective enabling high-performance retinal models trained on minimal data and resources, reducing costs and environmental impact.
Findings
RETFound-Green performs best on 68 out of 119 tasks.
It can be trained for less than $100, a substantial reduction from previous models.
RETFound-Green is 14 times faster to download and 2.7 times faster in embedding computation.
Abstract
Artificial Intelligence in medicine is traditionally limited by the lack of massive training datasets. Foundation models, pre-trained models that can be adapted to downstream tasks with small datasets, could alleviate this problem. Researchers at Moorfields Eye Hospital (MEH) proposed RETFound-MEH, a retinal foundation model trained on 900,000 images, including private hospital data. Recently, data-efficient DERETFound was proposed providing comparable performance while being trained on only 150,000 publicly available images. However, both these models required very substantial resources to train initially and are resource-intensive in downstream use. We propose a novel Token Reconstruction objective that we use to train RETFound-Green, a retinal foundation model trained using only 75,000 publicly available images and 400 times less compute. We estimate the cost of training RETFound-MEH…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRetinal Imaging and Analysis · Retinal and Macular Surgery · Medical Image Segmentation Techniques
