Assessing the Impact of Attention and Self-Attention Mechanisms on the Classification of Skin Lesions
Rafael Pedro, Arlindo L. Oliveira

TL;DR
This paper compares various attention and self-attention mechanisms in CNNs for skin lesion classification, finding that self-attention consistently improves performance more than attention modules.
Contribution
It provides an objective comparison of attention mechanisms in CNNs for skin lesion classification, highlighting the superior and consistent benefits of self-attention.
Findings
Attention modules sometimes improve CNN performance but lack consistency.
Self-attention mechanisms yield consistent and significant performance gains.
Self-attention achieves the best results with fewer parameters.
Abstract
Attention mechanisms have raised significant interest in the research community, since they promise significant improvements in the performance of neural network architectures. However, in any specific problem, we still lack a principled way to choose specific mechanisms and hyper-parameters that lead to guaranteed improvements. More recently, self-attention has been proposed and widely used in transformer-like architectures, leading to significant breakthroughs in some applications. In this work we focus on two forms of attention mechanisms: attention modules and self-attention. Attention modules are used to reweight the features of each layer input tensor. Different modules have different ways to perform this reweighting in fully connected or convolutional layers. The attention models studied are completely modular and in this work they will be used with the popular ResNet…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCutaneous Melanoma Detection and Management · AI in cancer detection
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · Residual Connection · 1x1 Convolution · Global Average Pooling · Batch Normalization · Max Pooling · Bottleneck Residual Block · Residual Block · Kaiming Initialization
