Gaussian Lower Bound for the Information Bottleneck Limit
Amichai Painsky, Naftali Tishby

TL;DR
This paper introduces a Gaussian lower bound for the Information Bottleneck (IB) curve, enabling efficient data embedding that approximates the IB trade-off and highlights the limits of linear methods in complex data representations.
Contribution
The work proposes a Gaussian lower bound for the IB curve and an embedding method that maximizes its Gaussian component, providing a practical approach to approximate the IB in arbitrary datasets.
Findings
The Gaussian IB (GIB) has a closed-form solution for Gaussian data.
The proposed embedding maximizes the Gaussian part of the data.
The optimal Gaussian embedding is bounded by non-linear CCA.
Abstract
The Information Bottleneck (IB) is a conceptual method for extracting the most compact, yet informative, representation of a set of variables, with respect to the target. It generalizes the notion of minimal sufficient statistics from classical parametric statistics to a broader information-theoretic sense. The IB curve defines the optimal trade-off between representation complexity and its predictive power. Specifically, it is achieved by minimizing the level of mutual information (MI) between the representation and the original variables, subject to a minimal level of MI between the representation and the target. This problem is shown to be in general NP hard. One important exception is the multivariate Gaussian case, for which the Gaussian IB (GIB) is known to obtain an analytical closed form solution, similar to Canonical Correlation Analysis (CCA). In this work we introduce a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Sensor Networks and Detection Algorithms · Age of Information Optimization
