Self-Attention Capsule Networks for Object Classification
Assaf Hoogi, Brian Wilcox, Yachee Gupta, Daniel L. Rubin

TL;DR
This paper introduces Self-Attention Capsule Networks (SACN), a novel architecture that integrates self-attention mechanisms into capsule networks to enhance object classification accuracy and robustness, especially on complex datasets.
Contribution
The paper presents the first integration of self-attention within capsule networks, improving their ability to analyze complex data and reducing computational load.
Findings
Outperformed baseline CapsNet, ResNet-18, and DenseNet-40 in accuracy.
Achieved better classification on diverse datasets, including medical images.
Enhanced robustness and focus on salient features.
Abstract
We propose a novel architecture for object classification, called Self-Attention Capsule Networks (SACN). SACN is the first model that incorporates the Self-Attention mechanism as an integral layer within the Capsule Network (CapsNet). While the Self-Attention mechanism supplies a long-range dependencies, results in selecting the more dominant image regions to focus on, the CapsNet analyzes the relevant features and their spatial correlations inside these regions only. The features are extracted in the convolutional layer. Then, the Self-Attention layer learns to suppress irrelevant regions based on features analysis and highlights salient features useful for a specific task. The attention map is then fed into the CapsNet primary layer that is followed by a classification layer. The proposed SACN model was designed to solve two main limitations of the baseline CapsNet - analysis of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · Domain Adaptation and Few-Shot Learning
MethodsCapsule Network
