IBCapsNet: Information Bottleneck Capsule Network for Noise-Robust Representation Learning

Canqun Xiang; Chen Yang; and Jiaoyan Zhao

arXiv:2603.20682·cs.CV·March 24, 2026

IBCapsNet: Information Bottleneck Capsule Network for Noise-Robust Representation Learning

Canqun Xiang, Chen Yang, and Jiaoyan Zhao

PDF

Open Access

TL;DR

IBCapsNet introduces an information bottleneck-based capsule architecture that achieves noise robustness and efficiency improvements over traditional capsule networks by replacing iterative routing with a variational aggregation mechanism.

Contribution

The paper proposes IBCapsNet, a novel capsule network using the IB principle and variational autoencoders, reducing computational cost and improving robustness against input noise.

Findings

01

Matches CapsNet accuracy on clean data

02

Significantly outperforms CapsNet under noise

03

Faster training and inference with fewer parameters

Abstract

Capsule networks (CapsNets) are superior at modeling hierarchical spatial relationships but suffer from two critical limitations: high computational cost due to iterative dynamic routing and poor robustness under input corruptions. To address these issues, we propose IBCapsNet, a novel capsule architecture grounded in the Information Bottleneck (IB) principle. Instead of iterative routing, IBCapsNet employs a one-pass variational aggregation mechanism, where primary capsules are first compressed into a global context representation and then processed by class-specific variational autoencoders (VAEs) to infer latent capsules regularized by the KL divergence. This design enables efficient inference while inherently filtering out noise. Experiments on MNIST, Fashion-MNIST, SVHN and CIFAR-10 show that IBCapsNet matches CapsNet in clean-data accuracy (achieving 99.41% on MNIST and 92.01% on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning