Hierarchical Object-Centric Learning with Capsule Networks
Riccardo Renzulli

TL;DR
This paper advances capsule networks by improving routing, optimizing primary capsules, and learning part-relationships, demonstrating their effectiveness in real-world vision tasks like UAV localization and medical imaging.
Contribution
It introduces novel methods for routing annealing, efficient primary capsule extraction, and part-relationship learning, enhancing CapsNet performance and interpretability.
Findings
Routing annealing improves small network performance.
Pruned backbones reduce memory and computation.
Low-entropy capsules better capture part-whole relationships.
Abstract
Capsule networks (CapsNets) were introduced to address convolutional neural networks limitations, learning object-centric representations that are more robust, pose-aware, and interpretable. They organize neurons into groups called capsules, where each capsule encodes the instantiation parameters of an object or one of its parts. Moreover, a routing algorithm connects capsules in different layers, thereby capturing hierarchical part-whole relationships in the data. This thesis investigates the intriguing aspects of CapsNets and focuses on three key questions to unlock their full potential. First, we explore the effectiveness of the routing algorithm, particularly in small-sized networks. We propose a novel method that anneals the number of routing iterations during training, enhancing performance in architectures with fewer parameters. Secondly, we investigate methods to extract…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies
