EquiCaps: Predictor-Free Pose-Aware Pre-Trained Capsule Networks

Athinoulla Konstantinou; Georgios Leontidis; Mamatha Thota; and Aiden Durrant

arXiv:2506.09895·cs.CV·June 12, 2025

EquiCaps: Predictor-Free Pose-Aware Pre-Trained Capsule Networks

Athinoulla Konstantinou, Georgios Leontidis, Mamatha Thota, and Aiden Durrant

PDF

Open Access 1 Repo 1 Datasets

TL;DR

EquiCaps introduces a capsule-based, predictor-free approach for learning pose-aware representations that inherently handle transformations, outperforming prior methods in rotation prediction and demonstrating strong generalisation under complex geometric transformations.

Contribution

The paper presents EquiCaps, a novel capsule network architecture that eliminates the need for predictor modules to enforce equivariance, leveraging capsules' intrinsic pose-awareness for improved self-supervised learning.

Findings

01

Outperforms state-of-the-art equivariant methods on rotation prediction with R^2 of 0.78.

02

Maintains robust equivariance under combined geometric transformations.

03

Demonstrates superior generalisation capabilities in complex transformation scenarios.

Abstract

Learning self-supervised representations that are invariant and equivariant to transformations is crucial for advancing beyond traditional visual classification tasks. However, many methods rely on predictor architectures to encode equivariance, despite evidence that architectural choices, such as capsule networks, inherently excel at learning interpretable pose-aware representations. To explore this, we introduce EquiCaps (Equivariant Capsule Network), a capsule-based approach to pose-aware self-supervision that eliminates the need for a specialised predictor for enforcing equivariance. Instead, we leverage the intrinsic pose-awareness capabilities of capsules to improve performance in pose estimation tasks. To further challenge our assumptions, we increase task complexity via multi-geometric transformations to enable a more thorough evaluation of invariance and equivariance by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aberdeenml/equicaps
pytorchOfficial

Datasets

athenacon/3DIEBench-T
dataset· 22 dl
22 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Robot Manipulation and Learning · Human Pose and Action Recognition