Robust Incomplete-Modality Alignment for Ophthalmic Disease Grading and Diagnosis via Labeled Optimal Transport

Qinkai Yu; Jianyang Xie; Yitian Zhao; Cheng Chen; Lijun Zhang; Liming Chen; Jun Cheng; Lu Liu; Yalin Zheng; Yanda Meng

arXiv:2507.04999·cs.CV·July 8, 2025

Robust Incomplete-Modality Alignment for Ophthalmic Disease Grading and Diagnosis via Labeled Optimal Transport

Qinkai Yu, Jianyang Xie, Yitian Zhao, Cheng Chen, Lijun Zhang, Liming Chen, Jun Cheng, Lu Liu, Yalin Zheng, Yanda Meng

PDF

TL;DR

This paper introduces a robust multimodal alignment framework using optimal transport to improve ophthalmic disease diagnosis with incomplete multimodal data, outperforming existing methods in accuracy and robustness.

Contribution

It proposes a novel optimal transport-based alignment and fusion framework that effectively handles missing modalities in ophthalmic diagnostics, addressing limitations of imputation and distillation methods.

Findings

01

Achieves state-of-the-art performance on multiple ophthalmic datasets.

02

Effectively handles various scenarios of incomplete multimodal data.

03

Demonstrates robustness and superior accuracy compared to existing methods.

Abstract

Multimodal ophthalmic imaging-based diagnosis integrates color fundus image with optical coherence tomography (OCT) to provide a comprehensive view of ocular pathologies. However, the uneven global distribution of healthcare resources often results in real-world clinical scenarios encountering incomplete multimodal data, which significantly compromises diagnostic accuracy. Existing commonly used pipelines, such as modality imputation and distillation methods, face notable limitations: 1)Imputation methods struggle with accurately reconstructing key lesion features, since OCT lesions are localized, while fundus images vary in style. 2)distillation methods rely heavily on fully paired multimodal training data. To address these challenges, we propose a novel multimodal alignment and fusion framework capable of robustly handling missing modalities in the task of ophthalmic diagnostics. By…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.