UT-Net: Combining U-Net and Transformer for Joint Optic Disc and Cup Segmentation and Glaucoma Detection
Rukhshanda Hussain, Hritam Basak

TL;DR
UT-Net combines U-Net and transformer architectures with attention mechanisms for accurate joint segmentation of optic disc and cup, improving glaucoma detection from retinal images.
Contribution
The paper introduces UT-Net, a novel segmentation framework integrating U-Net and transformer with multi-level attention and an enhanced loss for better glaucoma-related structure segmentation.
Findings
UT-Net outperforms state-of-the-art methods on three datasets.
The model achieves higher accuracy in optic disc and cup segmentation.
Improved glaucoma detection based on precise CDR measurement.
Abstract
Glaucoma is a chronic visual disease that may cause permanent irreversible blindness. Measurement of the cup-to-disc ratio (CDR) plays a pivotal role in the detection of glaucoma in its early stage, preventing visual disparities. Therefore, accurate and automatic segmentation of optic disc (OD) and optic cup (OC) from retinal fundus images is a fundamental requirement. Existing CNN-based segmentation frameworks resort to building deep encoders with aggressive downsampling layers, which suffer from a general limitation on modeling explicit long-range dependency. To this end, in this paper, we propose a new segmentation pipeline, called UT-Net, availing the advantages of U-Net and transformer both in its encoding layer, followed by an attention-gated bilinear fusion scheme. In addition to this, we incorporate Multi-Head Contextual attention to enhance the regular self-attention used in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRetinal Imaging and Analysis · Brain Tumor Detection and Classification · Glaucoma and retinal disorders
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Max Pooling · Convolution · U-Net
