Deep multi-prototype capsule networks
Saeid Abbassi, Kamaledin Ghiasi-Shirazi, Ahad Harati

TL;DR
This paper introduces a multi-prototype capsule network architecture that captures diverse image part variations, improves trainability, and enhances classification accuracy across multiple datasets.
Contribution
It proposes a multi-prototype capsule network with co-group capsules and shared weights, enabling deeper architectures and better handling of intra-class variations.
Findings
Outperforms existing models on MNIST, SVHN, and other datasets.
Allows deeper capsule networks due to parameter reduction.
Achieves higher classification accuracy.
Abstract
Capsule networks are a type of neural network that identify image parts and form the instantiation parameters of a whole hierarchically. The goal behind the network is to perform an inverse computer graphics task, and the network parameters are the mapping weights that transform parts into a whole. The trainability of capsule networks in complex data with high intra-class or intra-part variation is challenging. This paper presents a multi-prototype architecture for guiding capsule networks to represent the variations in the image parts. To this end, instead of considering a single capsule for each class and part, the proposed method employs several capsules (co-group capsules), capturing multiple prototypes of an object. In the final layer, co-group capsules compete, and their soft output is considered the target for a competitive cross-entropy loss. Moreover, in the middle layers, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle License Plate Recognition
MethodsCapsule Network
