MIRAGE: Multimodal foundation model and benchmark for comprehensive retinal OCT image analysis

Jos\'e Morano; Botond Fazekas; Emese S\"ukei; Ronald Fecso; Taha Emre; Markus Gumpinger; Georg Faustmann; Marzieh Oghbaie; Ursula Schmidt-Erfurth; Hrvoje Bogunovi\'c

arXiv:2506.08900·cs.CV·September 30, 2025

MIRAGE: Multimodal foundation model and benchmark for comprehensive retinal OCT image analysis

Jos\'e Morano, Botond Fazekas, Emese S\"ukei, Ronald Fecso, Taha Emre, Markus Gumpinger, Georg Faustmann, Marzieh Oghbaie, Ursula Schmidt-Erfurth, Hrvoje Bogunovi\'c

PDF

1 Repo 2 Models

TL;DR

MIRAGE is a new multimodal foundation model and benchmark that significantly improves retinal OCT image analysis by integrating OCT and SLO data, outperforming existing models in classification and segmentation tasks.

Contribution

This work introduces MIRAGE, a novel multimodal foundation model for OCT and SLO images, along with a comprehensive benchmark for evaluation, addressing limitations of existing models.

Findings

01

MIRAGE outperforms general and specialized FMs in classification tasks.

02

MIRAGE achieves superior segmentation performance on retinal OCT images.

03

The benchmark facilitates robust evaluation of AI models in ophthalmology.

Abstract

Artificial intelligence (AI) has become a fundamental tool for assisting clinicians in analyzing ophthalmic images, such as optical coherence tomography (OCT). However, developing AI models often requires extensive annotation, and existing models tend to underperform on independent, unseen data. Foundation models (FMs), large AI models trained on vast unlabeled datasets, have shown promise in overcoming these challenges. Nonetheless, available FMs for ophthalmology lack extensive validation, especially for segmentation tasks, and focus on a single imaging modality. In this context, we propose MIRAGE, a novel multimodal FM for the analysis of OCT and scanning laser ophthalmoscopy (SLO) images. Additionally, we propose a new evaluation benchmark with OCT/SLO classification and segmentation tasks. The comparison with general and specialized FMs and segmentation methods shows the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

j-morano/mirage
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus