TL;DR
This paper introduces Att-SwinU-Net, an advanced skin lesion segmentation model that integrates cross-contextual attention into the Swin U-Net architecture to improve long-range dependency capture and feature re-usability.
Contribution
It proposes an attention-based extension to Swin U-Net that enhances skip connections with attention mechanisms, improving segmentation performance in medical images.
Findings
Attention mechanism improves segmentation accuracy
Enhanced feature re-usability in skip connections
Effective across multiple skin lesion datasets
Abstract
Melanoma is caused by the abnormal growth of melanocytes in human skin. Like other cancers, this life-threatening skin cancer can be treated with early diagnosis. To support a diagnosis by automatic skin lesion segmentation, several Fully Convolutional Network (FCN) approaches, specifically the U-Net architecture, have been proposed. The U-Net model with a symmetrical architecture has exhibited superior performance in the segmentation task. However, the locality restriction of the convolutional operation incorporated in the U-Net architecture limits its performance in capturing long-range dependency, which is crucial for the segmentation task in medical images. To address this limitation, recently a Transformer based U-Net architecture that replaces the CNN blocks with the Swin Transformer module has been proposed to capture both local and global representation. In this paper, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · Position-Wise Feed-Forward Layer · Label Smoothing · Linear Layer · Max Pooling · Absolute Position Encodings · Layer Normalization
