Representation learning from OCT images
Hedi Tabia, D\'esir\'e Sidib\'e, Nawres Khlifa, Ahmed Tabia, Ines Rahmany, Noura Aboudi, Zainab Haddad, Hajer Khachnaoui, Hsouna Zgolli

TL;DR
This survey reviews recent advances in representation learning for OCT retinal images, covering deep learning, foundation models, and multimodal approaches to improve diagnosis and reduce annotation reliance.
Contribution
It provides a structured taxonomy of methods, analyzes core contributions, and discusses open research directions in OCT image representation learning.
Findings
Comprehensive overview of OCT representation learning methods.
Analysis of core methodological contributions and limitations.
Identification of key open research challenges.
Abstract
Optical Coherence Tomography (OCT) has become one of the most used imaging modality in ophthalmology. It provides high-resolution, non-invasive visualization of retinal microarchitecture. The automated analysis of OCT images through representation learning has emerged as a central research frontier. This has mainly been driven by the clinical need to process large acquisition volumes. The objective is to reduce the reliance on expert annotation, and improve diagnostic consistency across devices and populations. This survey provides a comprehensive and structured review of representation learning methods for retinal OCT image analysis. It covers the period from early deep learning approaches to the most recent developments in foundation models and vision-language systems. We organize the literature along a principled taxonomy of learning paradigms, encompassing supervised learning with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
