TL;DR
This paper introduces Efficient-CapsNet, a capsule network architecture that uses a novel self-attention routing algorithm, achieving state-of-the-art results with significantly fewer parameters and enhanced efficiency in visual representation learning.
Contribution
The paper proposes a new non-iterative, parallelizable routing algorithm and demonstrates that a highly compact capsule network can outperform larger models in efficiency and accuracy.
Findings
Achieves state-of-the-art results with only 2% of original CapsNet parameters.
Introduces a novel self-attention routing algorithm for capsule networks.
Demonstrates improved efficiency and generalization in visual tasks.
Abstract
Deep convolutional neural networks, assisted by architectural design strategies, make extensive use of data augmentation techniques and layers with a high number of feature maps to embed object transformations. That is highly inefficient and for large datasets implies a massive redundancy of features detectors. Even though capsules networks are still in their infancy, they constitute a promising solution to extend current convolutional networks and endow artificial visual perception with a process to encode more efficiently all feature affine transformations. Indeed, a properly working capsule network should theoretically achieve higher results with a considerably lower number of parameters count due to intrinsic capability to generalize to novel viewpoints. Nevertheless, little attention has been given to this relevant aspect. In this paper, we investigate the efficiency of capsule…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsCapsule Network
