GeoIB: Geometry-Aware Information Bottleneck via Statistical-Manifold Compression
Weiqi Wang, Zhiyi Tian, Chenhan Zhang, and Shui Yu

TL;DR
GeoIB introduces a geometry-aware approach to the Information Bottleneck problem that avoids mutual information estimation, leading to more stable optimization and better trade-offs between accuracy and compression.
Contribution
It proposes a novel geometric formulation of IB using information geometry, eliminating the need for MI estimation and improving stability and performance.
Findings
GeoIB achieves better accuracy-compression trade-off than baselines.
It enhances invariance and optimization stability.
The method unifies distributional and geometric regularization.
Abstract
Information Bottleneck (IB) is widely used, but in deep learning, it is usually implemented through tractable surrogates, such as variational bounds or neural mutual information (MI) estimators, rather than directly controlling the MI I(X;Z) itself. The looseness and estimator-dependent bias can make IB "compression" only indirectly controlled and optimization fragile. We revisit the IB problem through the lens of information geometry and propose a \textbf{Geo}metric \textbf{I}nformation \textbf{B}ottleneck (\textbf{GeoIB}) that dispenses with mutual information (MI) estimation. We show that I(X;Z) and I(Z;Y) admit exact projection forms as minimal Kullback-Leibler (KL) distances from the joint distributions to their respective independence manifolds. Guided by this view, GeoIB controls information compression with two complementary terms: (i) a distribution-level Fisher-Rao (FR)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis
