Capsule Network Projectors are Equivariant and Invariant Learners

Miles Everett; Aiden Durrant; Mingjun Zhong; and Georgios Leontidis

arXiv:2405.14386·cs.CV·October 7, 2025·1 cites

Capsule Network Projectors are Equivariant and Invariant Learners

Miles Everett, Aiden Durrant, Mingjun Zhong, and Georgios Leontidis

PDF

Open Access 3 Reviews

TL;DR

This paper introduces CapsIE, a novel self-supervised architecture using Capsule Networks to learn invariant and equivariant representations, achieving state-of-the-art results on 3D rotation tasks with improved efficiency.

Contribution

The work presents a new CapsNet-based architecture with a specialized entropy minimization objective for invariant-equivariant learning, outperforming prior methods on equivariant tasks.

Findings

01

CapsIE achieves state-of-the-art on 3DIEBench rotation tasks.

02

CapsNets effectively learn complex, generalised representations.

03

The approach is more efficient with fewer parameters.

Abstract

Learning invariant representations has been the long-standing approach to self-supervised learning. However, recently progress has been made in preserving equivariant properties in representations, yet do so with highly prescribed architectures. In this work, we propose an invariant-equivariant self-supervised architecture that employs Capsule Networks (CapsNets), which have been shown to capture equivariance with respect to novel viewpoints. We demonstrate that the use of CapsNets in equivariant self-supervised architectures achieves improved downstream performance on equivariant tasks with higher efficiency and fewer network parameters. To accommodate the architectural changes of CapsNets, we introduce a new objective function based on entropy minimisation. This approach, which we name CapsIE (Capsule Invariant Equivariant Network), achieves state-of-the-art performance on the…

Peer Reviews

Decision·ICLR 2025 Conference Withdrawn Submission

Reviewer 01Rating 5Confidence 4

Strengths

1. This is an intriguing work that empowers the inherent equivariance characteristics of capsule networks more effectively. 2. The significance of the proposed method has been demonstrated using more complex benchmarks compared to those traditionally used for validation in existing capsule network research.

Weaknesses

1. The citation format is incorrect. Most citations should be within parentheses, but in all instances, they are presented without them. 2. While the methodology indeed enhances the equivariance of the original capsule network, there are still inherent limitations in the loss function that restrict its extension to real-world benchmarks. Real-world benchmarks often do not contain only a single (needed) equivariant factor; e.g., if there are five factors simultaneously, how would the loss functio

Reviewer 02Rating 3Confidence 4

Strengths

- Equivariant self-supervised learning is an interesting topic and is not systematically investigated in the literature. - The entropy minimization objective introduced in the paper is novel. It is an interesting approach for enhancing the representation learning capacity of Capsule Networks. - Promising initial results on the 3DIEBench dataset.

Weaknesses

- My first main concern is that the proposed method relies heavily on Capsule Networks, which have been shown in prior research to be generally less robust and scalable compared to CNNs for a wide range of tasks (see [A]). - My second main concern is the limited scope of experiments. The experiments are confined to a single dataset (3DIEBench). This narrow experimental scope significantly limits the impact of the results and raises concerns about the practical applicability of CapsIE. - As men

Reviewer 03Rating 5Confidence 4

Strengths

- The proposed method is sound. it is very clear how the characteristics of the inner-workings and representations of capsule networks match the invariance/equivariance goals targeted by the proposed method. - The different design decisions made throughout the paper, in both the proposed method and conducted experiments, are clear and well motivated. - The proposed method outperforms existing methods in the benchmark related to the considered 3DIEBench dataset. - The paper has a g

Weaknesses

- There is significant repetition between the contents of the Introduction (Sec.1) and Related Work (Sec.6) sections around the topics of equivariance and invariance. A revision/re-organization of this content may help improve the clarity/flow of the paper and help make additional room for extending the experimentation and/or discussion of already present experiments. Moreover, moving the description/introduction of Capsule Networks from Sec. 6.2. to an earlier section in the paper, would assist

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Neural Networks and Reservoir Computing

MethodsCapsule Network