GLEAM: A Multimodal Imaging Dataset and HAMM for Glaucoma Classification

Jiao Wang; Chi Liu; Yiying Zhang; Hongchen Luo; Zhifen Guo; Ying Hu; Ke Xu; Jing Zhou; Hongyan Xu; Ruiting Zhou; Man Tang

arXiv:2603.12800·eess.IV·May 12, 2026

GLEAM: A Multimodal Imaging Dataset and HAMM for Glaucoma Classification

Jiao Wang, Chi Liu, Yiying Zhang, Hongchen Luo, Zhifen Guo, Ying Hu, Ke Xu, Jing Zhou, Hongyan Xu, Ruiting Zhou, Man Tang

PDF

TL;DR

This paper introduces GLEAM, a comprehensive multimodal glaucoma dataset, and HAMM, a novel hierarchical attentive modeling framework for improved disease classification using diverse imaging modalities.

Contribution

The paper presents the first publicly available tri-modal glaucoma dataset and a new hierarchical attentive masked modeling approach for multimodal disease classification.

Findings

01

GLEAM enables effective multimodal analysis for glaucoma diagnosis.

02

HAMM improves cross-modal feature integration for classification accuracy.

03

The framework facilitates accurate glaucoma staging across multiple disease stages.

Abstract

We propose glaucoma lesion evaluation and analysis with multimodal imaging (GLEAM), the first publicly available tri-modal glaucoma dataset comprising scanning laser ophthalmoscopy fundus images, circumpapillary OCT images, and visual field pattern deviation maps, annotated with four disease stages, enabling effective exploitation of multimodal complementary information and facilitating accurate diagnosis and treatment across disease stages. To effectively integrate cross-modal information, we propose hierarchical attentive masked modeling (HAMM) for multimodal glaucoma classification. Our framework employs hierarchical attentive encoders and light decoders to focus cross-modal representation learning on the encoder.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.