The Convex Information Bottleneck Lagrangian

Borja Rodr\'iguez-G\'alvez; Ragnar Thobaben; Mikael Skoglund

arXiv:1911.11000·stat.ML·February 19, 2020

The Convex Information Bottleneck Lagrangian

Borja Rodr\'iguez-G\'alvez, Ragnar Thobaben, Mikael Skoglund

PDF

2 Repos

TL;DR

This paper introduces a family of Lagrangians for the information bottleneck problem that enables efficient exploration of the IB curve across all scenarios, simplifying the process of obtaining representations with desired predictability and compression levels.

Contribution

It presents a general family of Lagrangians for the IB problem, establishes a precise mapping between Lagrange multipliers and compression rates, and demonstrates approximate control over compression levels with a single optimization.

Findings

01

Unified Lagrangian family for all IB scenarios

02

Exact mapping between Lagrange multiplier and compression rate

03

Single optimization suffices to achieve desired compression

Abstract

The information bottleneck (IB) problem tackles the issue of obtaining relevant compressed representations $T$ of some random variable $X$ for the task of predicting $Y$ . It is defined as a constrained optimization problem which maximizes the information the representation has about the task, $I (T; Y)$ , while ensuring that a certain level of compression $r$ is achieved (i.e., $I (X; T) \leq r$ ). For practical reasons, the problem is usually solved by maximizing the IB Lagrangian (i.e., $L_{IB} (T; β) = I (T; Y) - β I (X; T)$ ) for many values of $β \in [0, 1]$ . Then, the curve of maximal $I (T; Y)$ for a given $I (X; T)$ is drawn and a representation with the desired predictability and compression is selected. It is known when $Y$ is a deterministic function of $X$ , the IB curve cannot be explored and another Lagrangian has been proposed to tackle this problem: the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.