Conquering the Retina: Bringing Visual in-Context Learning to OCT

Alessio Negrini; Simon Rei{\ss}

arXiv:2506.15200·cs.CV·June 19, 2025

Conquering the Retina: Bringing Visual in-Context Learning to OCT

Alessio Negrini, Simon Rei{\ss}

PDF

Open Access 1 Repo

TL;DR

This paper explores training generalist models for retinal OCT using visual in-context learning, enabling flexible task adaptation without extensive retraining, and provides a comprehensive evaluation protocol and baseline results.

Contribution

It introduces a novel application of visual in-context learning to retinal OCT, along with an evaluation protocol and baseline results for future research.

Findings

01

VICL can generalize across multiple OCT tasks

02

Baseline results highlight current limitations of VICL in OCT

03

Open-source code facilitates further research

Abstract

Recent advancements in medical image analysis have led to the development of highly specialized models tailored to specific clinical tasks. These models have demonstrated exceptional performance and remain a crucial research direction. Yet, their applicability is limited to predefined tasks, requiring expertise and extensive resources for development and adaptation. In contrast, generalist models offer a different form of utility: allowing medical practitioners to define tasks on the fly without the need for task-specific model development. In this work, we explore how to train generalist models for the domain of retinal optical coherence tomography using visual in-context learning (VICL), i.e., training models to generalize across tasks based on a few examples provided at inference time. To facilitate rigorous assessment, we propose a broad evaluation protocol tailored to VICL in OCT.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

negralessio/thesis-visual-in-context-learning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRetinal Imaging and Analysis