A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert   Knowledge in Text Supervision

Julio Silva-Rodr\'iguez; Hadi Chakor; Riadh Kobbi; Jose Dolz and; Ismail Ben Ayed

arXiv:2308.07898·cs.CV·January 16, 2025

A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision

Julio Silva-Rodr\'iguez, Hadi Chakor, Riadh Kobbi, Jose Dolz and, Ismail Ben Ayed

PDF

Open Access 1 Repo 2 Models

TL;DR

FLAIR is a vision-language model for retinal imaging that incorporates expert clinical knowledge through textual prompts, significantly improving generalization and performance in medical imaging tasks, especially under domain shifts and limited data.

Contribution

The paper introduces FLAIR, a novel pre-trained vision-language model that embeds expert knowledge via textual prompts, enhancing medical image understanding beyond existing models.

Findings

01

FLAIR outperforms dataset-focused models in few-shot scenarios.

02

Incorporating expert knowledge improves zero-shot generalization.

03

FLAIR surpasses larger generalist models and self-supervised networks in retinal tasks.

Abstract

Foundation vision-language models are currently transforming computer vision, and are on the rise in medical imaging fueled by their very promising generalization capabilities. However, the initial attempts to transfer this new paradigm to medical imaging have shown less impressive performances than those observed in other domains, due to the significant domain shift and the complex, expert domain knowledge inherent to medical-imaging tasks. Motivated by the need for domain-expert foundation models, we present FLAIR, a pre-trained vision-language model for universal retinal fundus image understanding. To this end, we compiled 38 open-access, mostly categorical fundus imaging datasets from various sources, with up to 101 different target conditions and 288,307 images. We integrate the expert's domain knowledge in the form of descriptive textual prompts, during both pre-training and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jusiro/flair
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Retinal Imaging and Analysis · Cerebral Venous Sinus Thrombosis