Multi-scale Spatio-temporal Transformer-based Imbalanced Longitudinal   Learning for Glaucoma Forecasting from Irregular Time Series Images

Xikai Yang; Jian Wu; Xi Wang; Yuchen Yuan; Ning Li Wang; Pheng-Ann; Heng

arXiv:2402.13475·cs.CV·February 22, 2024·1 cites

Multi-scale Spatio-temporal Transformer-based Imbalanced Longitudinal Learning for Glaucoma Forecasting from Irregular Time Series Images

Xikai Yang, Jian Wu, Xi Wang, Yuchen Yuan, Ning Li Wang, Pheng-Ann, Heng

PDF

Open Access

TL;DR

This paper introduces MST-former, a multi-scale spatio-temporal transformer model designed for irregular, imbalanced longitudinal eye image data to improve glaucoma forecasting accuracy.

Contribution

The study proposes a novel transformer architecture with multi-scale features and a time-aware attention mechanism tailored for irregular, imbalanced longitudinal medical imaging data.

Findings

01

Achieved 98.6% AUC on glaucoma forecasting

02

Demonstrated strong generalization on Alzheimer's MRI data

03

Outperformed existing methods significantly

Abstract

Glaucoma is one of the major eye diseases that leads to progressive optic nerve fiber damage and irreversible blindness, afflicting millions of individuals. Glaucoma forecast is a good solution to early screening and intervention of potential patients, which is helpful to prevent further deterioration of the disease. It leverages a series of historical fundus images of an eye and forecasts the likelihood of glaucoma occurrence in the future. However, the irregular sampling nature and the imbalanced class distribution are two challenges in the development of disease forecasting approaches. To this end, we introduce the Multi-scale Spatio-temporal Transformer Network (MST-former) based on the transformer architecture tailored for sequential image inputs, which can effectively learn representative semantic information from sequential images on both temporal and spatial dimensions.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRetinal Imaging and Analysis · Glaucoma and retinal disorders

MethodsLinear Layer · Dense Connections · Label Smoothing · Adam · Attention Is All You Need · Softmax · Multi-Head Attention · Layer Normalization · Dropout · Residual Connection