Efficient-CapsNet: Capsule Network with Self-Attention Routing

Vittorio Mazzia; Francesco Salvetti; Marcello Chiaberge

arXiv:2101.12491·cs.CV·December 21, 2021

Efficient-CapsNet: Capsule Network with Self-Attention Routing

Vittorio Mazzia, Francesco Salvetti, Marcello Chiaberge

PDF

2 Repos

TL;DR

This paper introduces Efficient-CapsNet, a capsule network architecture that uses a novel self-attention routing algorithm, achieving state-of-the-art results with significantly fewer parameters and enhanced efficiency in visual representation learning.

Contribution

The paper proposes a new non-iterative, parallelizable routing algorithm and demonstrates that a highly compact capsule network can outperform larger models in efficiency and accuracy.

Findings

01

Achieves state-of-the-art results with only 2% of original CapsNet parameters.

02

Introduces a novel self-attention routing algorithm for capsule networks.

03

Demonstrates improved efficiency and generalization in visual tasks.

Abstract

Deep convolutional neural networks, assisted by architectural design strategies, make extensive use of data augmentation techniques and layers with a high number of feature maps to embed object transformations. That is highly inefficient and for large datasets implies a massive redundancy of features detectors. Even though capsules networks are still in their infancy, they constitute a promising solution to extend current convolutional networks and endow artificial visual perception with a process to encode more efficiently all feature affine transformations. Indeed, a properly working capsule network should theoretically achieve higher results with a considerably lower number of parameters count due to intrinsic capability to generalize to novel viewpoints. Nevertheless, little attention has been given to this relevant aspect. In this paper, we investigate the efficiency of capsule…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsCapsule Network