(M)SLAe-Net: Multi-Scale Multi-Level Attention embedded Network for Retinal Vessel Segmentation
Shreshth Saini, Geetika Agrawal

TL;DR
This paper introduces (M)SLAe-Net, a multi-scale, multi-level attention CNN for retinal vessel segmentation that effectively captures local and global features, improving accuracy and robustness across multiple datasets.
Contribution
The paper presents a novel multi-scale, multi-level attention network with a dynamic dilated pyramid pooling module and a specialized attention mechanism, reducing the need for multi-stage processing.
Findings
Outperforms existing methods on DRIVE, STARE, HRF, and CHASE-DB1 datasets.
Effectively segments vessels with varying shapes and sizes.
Demonstrates improved cross-dataset generalization.
Abstract
Segmentation plays a crucial role in diagnosis. Studying the retinal vasculatures from fundus images help identify early signs of many crucial illnesses such as diabetic retinopathy. Due to the varying shape, size, and patterns of retinal vessels, along with artefacts and noises in fundus images, no one-stage method can accurately segment retinal vessels. In this work, we propose a multi-scale, multi-level attention embedded CNN architecture ((M)SLAe-Net) to address the issue of multi-stage processing for robust and precise segmentation of retinal vessels. We do this by extracting features at multiple scales and multiple levels of the network, enabling our model to holistically extracts the local and global features. Multi-scale features are extracted using our novel dynamic dilated pyramid pooling (D-DPP) module. We also aggregate the features from all the network levels. These…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
