Dense and Diverse Capsule Networks: Making the Capsules Learn Better

Sai Samarth R Phaye; Apoorva Sikka; Abhinav Dhall; Deepti Bathula

arXiv:1805.04001·cs.CV·May 11, 2018·45 cites

Dense and Diverse Capsule Networks: Making the Capsules Learn Better

Sai Samarth R Phaye, Apoorva Sikka, Abhinav Dhall, Deepti Bathula

PDF

Open Access 1 Repo

TL;DR

This paper introduces Dense and Diverse Capsule Networks, enhancing capsule learning with dense and hierarchical architectures, achieving state-of-the-art results on multiple image classification benchmarks with fewer training iterations.

Contribution

It proposes Dense and Diverse Capsule Networks that improve feature learning and efficiency over traditional CapsNet by using dense connections and hierarchical structures.

Findings

01

DCNet achieves 99.75% on MNIST with fewer training iterations.

02

DCNet++ outperforms CapsNet on SVHN and CIFAR-10 datasets.

03

Proposed architectures reduce training time and model complexity.

Abstract

Past few years have witnessed exponential growth of interest in deep learning methodologies with rapidly improving accuracies and reduced computational complexity. In particular, architectures using Convolutional Neural Networks (CNNs) have produced state-of-the-art performances for image classification and object recognition tasks. Recently, Capsule Networks (CapsNet) achieved significant increase in performance by addressing an inherent limitation of CNNs in encoding pose and deformation. Inspired by such advancement, we asked ourselves, can we do better? We propose Dense Capsule Networks (DCNet) and Diverse Capsule Networks (DCNet++). The two proposed frameworks customize the CapsNet by replacing the standard convolutional layers with densely connected convolutions. This helps in incorporating feature maps learned by different layers in forming the primary capsules. DCNet,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ssrp/Multi-level-DCNet
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Multimodal Machine Learning Applications