Capsule Network Projectors are Equivariant and Invariant Learners
Miles Everett, Aiden Durrant, Mingjun Zhong, and Georgios Leontidis

TL;DR
This paper introduces CapsIE, a novel self-supervised architecture using Capsule Networks to learn invariant and equivariant representations, achieving state-of-the-art results on 3D rotation tasks with improved efficiency.
Contribution
The work presents a new CapsNet-based architecture with a specialized entropy minimization objective for invariant-equivariant learning, outperforming prior methods on equivariant tasks.
Findings
CapsIE achieves state-of-the-art on 3DIEBench rotation tasks.
CapsNets effectively learn complex, generalised representations.
The approach is more efficient with fewer parameters.
Abstract
Learning invariant representations has been the long-standing approach to self-supervised learning. However, recently progress has been made in preserving equivariant properties in representations, yet do so with highly prescribed architectures. In this work, we propose an invariant-equivariant self-supervised architecture that employs Capsule Networks (CapsNets), which have been shown to capture equivariance with respect to novel viewpoints. We demonstrate that the use of CapsNets in equivariant self-supervised architectures achieves improved downstream performance on equivariant tasks with higher efficiency and fewer network parameters. To accommodate the architectural changes of CapsNets, we introduce a new objective function based on entropy minimisation. This approach, which we name CapsIE (Capsule Invariant Equivariant Network), achieves state-of-the-art performance on the…
Peer Reviews
Decision·ICLR 2025 Conference Withdrawn Submission
1. This is an intriguing work that empowers the inherent equivariance characteristics of capsule networks more effectively. 2. The significance of the proposed method has been demonstrated using more complex benchmarks compared to those traditionally used for validation in existing capsule network research.
1. The citation format is incorrect. Most citations should be within parentheses, but in all instances, they are presented without them. 2. While the methodology indeed enhances the equivariance of the original capsule network, there are still inherent limitations in the loss function that restrict its extension to real-world benchmarks. Real-world benchmarks often do not contain only a single (needed) equivariant factor; e.g., if there are five factors simultaneously, how would the loss functio
- Equivariant self-supervised learning is an interesting topic and is not systematically investigated in the literature. - The entropy minimization objective introduced in the paper is novel. It is an interesting approach for enhancing the representation learning capacity of Capsule Networks. - Promising initial results on the 3DIEBench dataset.
- My first main concern is that the proposed method relies heavily on Capsule Networks, which have been shown in prior research to be generally less robust and scalable compared to CNNs for a wide range of tasks (see [A]). - My second main concern is the limited scope of experiments. The experiments are confined to a single dataset (3DIEBench). This narrow experimental scope significantly limits the impact of the results and raises concerns about the practical applicability of CapsIE. - As men
- The proposed method is sound. it is very clear how the characteristics of the inner-workings and representations of capsule networks match the invariance/equivariance goals targeted by the proposed method. - The different design decisions made throughout the paper, in both the proposed method and conducted experiments, are clear and well motivated. - The proposed method outperforms existing methods in the benchmark related to the considered 3DIEBench dataset. - The paper has a g
- There is significant repetition between the contents of the Introduction (Sec.1) and Related Work (Sec.6) sections around the topics of equivariance and invariance. A revision/re-organization of this content may help improve the clarity/flow of the paper and help make additional room for extending the experimentation and/or discussion of already present experiments. Moreover, moving the description/introduction of Capsule Networks from Sec. 6.2. to an earlier section in the paper, would assist
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Neural Networks and Reservoir Computing
MethodsCapsule Network
