Hierarchical Object-Centric Learning with Capsule Networks

Riccardo Renzulli

arXiv:2405.19861·cs.CV·May 31, 2024

Hierarchical Object-Centric Learning with Capsule Networks

Riccardo Renzulli

PDF

Open Access

TL;DR

This paper advances capsule networks by improving routing, optimizing primary capsules, and learning part-relationships, demonstrating their effectiveness in real-world vision tasks like UAV localization and medical imaging.

Contribution

It introduces novel methods for routing annealing, efficient primary capsule extraction, and part-relationship learning, enhancing CapsNet performance and interpretability.

Findings

01

Routing annealing improves small network performance.

02

Pruned backbones reduce memory and computation.

03

Low-entropy capsules better capture part-whole relationships.

Abstract

Capsule networks (CapsNets) were introduced to address convolutional neural networks limitations, learning object-centric representations that are more robust, pose-aware, and interpretable. They organize neurons into groups called capsules, where each capsule encodes the instantiation parameters of an object or one of its parts. Moreover, a routing algorithm connects capsules in different layers, thereby capturing hierarchical part-whole relationships in the data. This thesis investigates the intriguing aspects of CapsNets and focuses on three key questions to unlock their full potential. First, we explore the effectiveness of the routing algorithm, particularly in small-sized networks. We propose a novel method that anneals the number of routing iterations during training, enhancing performance in architectures with fewer parameters. Secondly, we investigate methods to extract…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies