Training a high-performance retinal foundation model with half-the-data   and 400 times less compute

Justin Engelmann; Miguel O. Bernabeu

arXiv:2405.00117·cs.CV·September 24, 2024

Training a high-performance retinal foundation model with half-the-data and 400 times less compute

Justin Engelmann, Miguel O. Bernabeu

PDF

Open Access

TL;DR

This paper introduces RETFound-Green, a retinal foundation model trained with significantly less data and compute, achieving high performance and efficiency across multiple downstream tasks and datasets.

Contribution

The authors propose a novel Token Reconstruction training objective enabling high-performance retinal models trained on minimal data and resources, reducing costs and environmental impact.

Findings

01

RETFound-Green performs best on 68 out of 119 tasks.

02

It can be trained for less than $100, a substantial reduction from previous models.

03

RETFound-Green is 14 times faster to download and 2.7 times faster in embedding computation.

Abstract

Artificial Intelligence in medicine is traditionally limited by the lack of massive training datasets. Foundation models, pre-trained models that can be adapted to downstream tasks with small datasets, could alleviate this problem. Researchers at Moorfields Eye Hospital (MEH) proposed RETFound-MEH, a retinal foundation model trained on 900,000 images, including private hospital data. Recently, data-efficient DERETFound was proposed providing comparable performance while being trained on only 150,000 publicly available images. However, both these models required very substantial resources to train initially and are resource-intensive in downstream use. We propose a novel Token Reconstruction objective that we use to train RETFound-Green, a retinal foundation model trained using only 75,000 publicly available images and 400 times less compute. We estimate the cost of training RETFound-MEH…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRetinal Imaging and Analysis · Retinal and Macular Surgery · Medical Image Segmentation Techniques