IBCapsNet: Information Bottleneck Capsule Network for Noise-Robust Representation Learning
Canqun Xiang, Chen Yang, and Jiaoyan Zhao

TL;DR
IBCapsNet introduces an information bottleneck-based capsule architecture that achieves noise robustness and efficiency improvements over traditional capsule networks by replacing iterative routing with a variational aggregation mechanism.
Contribution
The paper proposes IBCapsNet, a novel capsule network using the IB principle and variational autoencoders, reducing computational cost and improving robustness against input noise.
Findings
Matches CapsNet accuracy on clean data
Significantly outperforms CapsNet under noise
Faster training and inference with fewer parameters
Abstract
Capsule networks (CapsNets) are superior at modeling hierarchical spatial relationships but suffer from two critical limitations: high computational cost due to iterative dynamic routing and poor robustness under input corruptions. To address these issues, we propose IBCapsNet, a novel capsule architecture grounded in the Information Bottleneck (IB) principle. Instead of iterative routing, IBCapsNet employs a one-pass variational aggregation mechanism, where primary capsules are first compressed into a global context representation and then processed by class-specific variational autoencoders (VAEs) to infer latent capsules regularized by the KL divergence. This design enables efficient inference while inherently filtering out noise. Experiments on MNIST, Fashion-MNIST, SVHN and CIFAR-10 show that IBCapsNet matches CapsNet in clean-data accuracy (achieving 99.41% on MNIST and 92.01% on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
