RetFiner: A Vision-Language Refinement Scheme for Retinal Foundation Models
Ronald Fecso, Jos\'e Morano, Ursula Schmidt-Erfurth, Hrvoje Bogunovi\'c

TL;DR
RetFiner is a novel vision-language refinement scheme that enhances retinal foundation models trained on OCT images by leveraging textual data, leading to improved performance on diverse retinal disease classification tasks.
Contribution
It introduces a new SSL vision-language refinement method that boosts existing retinal foundation models' semantic understanding and downstream task performance.
Findings
Significant improvements in linear probing accuracy on OCT classification tasks.
Average performance increase of 5.8, 3.9, and 2.1 percentage points over baselines.
RetFiner enables direct adaptation of models to specific populations.
Abstract
The rise of imaging techniques such as optical coherence tomography (OCT) and advances in deep learning (DL) have enabled clinicians and researchers to streamline retinal disease staging. A popular DL approach is self-supervised learning (SSL), where models learn from vast amounts of unlabeled data, avoiding costly annotation. SSL has allowed the development of foundation models (FMs), large models that can be used for a variety of downstream tasks. However, existing FMs for OCT, trained solely on image data, lack a comprehensive and robust semantic understanding of images, as evidenced by their downstream performance (especially for complex tasks), and thus require supervised fine-tuning (which may be unfeasible) to better adapt to specific applications and populations. To address this, we propose RetFiner, an SSL vision-language refinement scheme that improves the representations of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRetinal Imaging and Analysis · Domain Adaptation and Few-Shot Learning · Retinal Diseases and Treatments
MethodsSparse Evolutionary Training
