Conquering the Retina: Bringing Visual in-Context Learning to OCT
Alessio Negrini, Simon Rei{\ss}

TL;DR
This paper explores training generalist models for retinal OCT using visual in-context learning, enabling flexible task adaptation without extensive retraining, and provides a comprehensive evaluation protocol and baseline results.
Contribution
It introduces a novel application of visual in-context learning to retinal OCT, along with an evaluation protocol and baseline results for future research.
Findings
VICL can generalize across multiple OCT tasks
Baseline results highlight current limitations of VICL in OCT
Open-source code facilitates further research
Abstract
Recent advancements in medical image analysis have led to the development of highly specialized models tailored to specific clinical tasks. These models have demonstrated exceptional performance and remain a crucial research direction. Yet, their applicability is limited to predefined tasks, requiring expertise and extensive resources for development and adaptation. In contrast, generalist models offer a different form of utility: allowing medical practitioners to define tasks on the fly without the need for task-specific model development. In this work, we explore how to train generalist models for the domain of retinal optical coherence tomography using visual in-context learning (VICL), i.e., training models to generalize across tasks based on a few examples provided at inference time. To facilitate rigorous assessment, we propose a broad evaluation protocol tailored to VICL in OCT.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRetinal Imaging and Analysis
