Joint Segmentation and Grading with Iterative Optimization for Multimodal Glaucoma Diagnosis
Zhiwei Wang, Yuxing Li, Meilu Zhu, Defeng He, Edmund Y. Lam

TL;DR
This paper introduces an iterative multimodal optimization model that combines fundus and OCT data with advanced feature alignment and refinement techniques to improve glaucoma diagnosis accuracy.
Contribution
It presents a novel joint segmentation and grading framework using mid-level fusion and diffusion-based refinement for multimodal glaucoma assessment.
Findings
Effective multimodal feature integration demonstrated
Improved segmentation of optic disc and cup
Accurate glaucoma grading achieved
Abstract
Accurate diagnosis of glaucoma is challenging, as early-stage changes are subtle and often lack clear structural or appearance cues. Most existing approaches rely on a single modality, such as fundus or optical coherence tomography (OCT), capturing only partial pathological information and often missing early disease progression. In this paper, we propose an iterative multimodal optimization model (IMO) for joint segmentation and grading. IMO integrates fundus and OCT features through a mid-level fusion strategy, enhanced by a cross-modal feature alignment (CMFA) module to reduce modality discrepancies. An iterative refinement decoder progressively optimizes the multimodal features through a denoising diffusion mechanism, enabling fine-grained segmentation of the optic disc and cup while supporting accurate glaucoma grading. Extensive experiments show that our method effectively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRetinal Imaging and Analysis · Glaucoma and retinal disorders · AI in cancer detection
