Dense and Diverse Capsule Networks: Making the Capsules Learn Better
Sai Samarth R Phaye, Apoorva Sikka, Abhinav Dhall, Deepti Bathula

TL;DR
This paper introduces Dense and Diverse Capsule Networks, enhancing capsule learning with dense and hierarchical architectures, achieving state-of-the-art results on multiple image classification benchmarks with fewer training iterations.
Contribution
It proposes Dense and Diverse Capsule Networks that improve feature learning and efficiency over traditional CapsNet by using dense connections and hierarchical structures.
Findings
DCNet achieves 99.75% on MNIST with fewer training iterations.
DCNet++ outperforms CapsNet on SVHN and CIFAR-10 datasets.
Proposed architectures reduce training time and model complexity.
Abstract
Past few years have witnessed exponential growth of interest in deep learning methodologies with rapidly improving accuracies and reduced computational complexity. In particular, architectures using Convolutional Neural Networks (CNNs) have produced state-of-the-art performances for image classification and object recognition tasks. Recently, Capsule Networks (CapsNet) achieved significant increase in performance by addressing an inherent limitation of CNNs in encoding pose and deformation. Inspired by such advancement, we asked ourselves, can we do better? We propose Dense Capsule Networks (DCNet) and Diverse Capsule Networks (DCNet++). The two proposed frameworks customize the CapsNet by replacing the standard convolutional layers with densely connected convolutions. This helps in incorporating feature maps learned by different layers in forming the primary capsules. DCNet,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Multimodal Machine Learning Applications
