TL;DR
This paper introduces a family of Lagrangians for the information bottleneck problem that enables efficient exploration of the IB curve across all scenarios, simplifying the process of obtaining representations with desired predictability and compression levels.
Contribution
It presents a general family of Lagrangians for the IB problem, establishes a precise mapping between Lagrange multipliers and compression rates, and demonstrates approximate control over compression levels with a single optimization.
Findings
Unified Lagrangian family for all IB scenarios
Exact mapping between Lagrange multiplier and compression rate
Single optimization suffices to achieve desired compression
Abstract
The information bottleneck (IB) problem tackles the issue of obtaining relevant compressed representations of some random variable for the task of predicting . It is defined as a constrained optimization problem which maximizes the information the representation has about the task, , while ensuring that a certain level of compression is achieved (i.e., ). For practical reasons, the problem is usually solved by maximizing the IB Lagrangian (i.e., ) for many values of . Then, the curve of maximal for a given is drawn and a representation with the desired predictability and compression is selected. It is known when is a deterministic function of , the IB curve cannot be explored and another Lagrangian has been proposed to tackle this problem: the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
