Multi-OCT-SelfNet: Integrating Self-Supervised Learning with   Multi-Source Data Fusion for Enhanced Multi-Class Retinal Disease   Classification

Fatema-E- Jannat; Sina Gholami; Jennifer I. Lim; Theodore Leng; Minhaj; Nur Alam; and Hamed Tabkhi

arXiv:2409.11375·cs.CV·September 18, 2024·2 cites

Multi-OCT-SelfNet: Integrating Self-Supervised Learning with Multi-Source Data Fusion for Enhanced Multi-Class Retinal Disease Classification

Fatema-E- Jannat, Sina Gholami, Jennifer I. Lim, Theodore Leng, Minhaj, Nur Alam, and Hamed Tabkhi

PDF

Open Access

TL;DR

This paper introduces Multi-OCT-SelfNet, a self-supervised learning framework that fuses multi-source data to improve retinal disease classification from OCT images, especially under limited data conditions.

Contribution

It presents a novel multi-modal data fusion approach combined with self-supervised pre-training using LLMs and SwinV2, enhancing generalization in retinal disease diagnosis.

Findings

01

Consistent performance across three datasets

02

Superior generalization compared to ResNet-50

03

Effective under low data availability conditions

Abstract

In the medical domain, acquiring large datasets poses significant challenges due to privacy concerns. Nonetheless, the development of a robust deep-learning model for retinal disease diagnosis necessitates a substantial dataset for training. The capacity to generalize effectively on smaller datasets remains a persistent challenge. The scarcity of data presents a significant barrier to the practical implementation of scalable medical AI solutions. To address this issue, we've combined a wide range of data sources to improve performance and generalization to new data by giving it a deeper understanding of the data representation from multi-modal datasets and developed a self-supervised framework based on large language models (LLMs), SwinV2 to gain a deeper understanding of multi-modal dataset representations, enhancing the model's ability to extrapolate to new data for the detection of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRetinal Imaging and Analysis