TL;DR
This paper introduces the redundancy bottleneck framework, a novel information bottleneck approach to quantify and analyze redundancy in information sources about a target, enabling efficient computation and source subset identification.
Contribution
It formulates partial information decomposition as an information bottleneck problem, providing a new method to quantify redundancy and identify redundant sources without combinatorial optimization.
Findings
Redundancy as an information bottleneck formalization.
Efficient iterative algorithm for RB curve computation.
Ability to identify redundant source subsets at multiple scales.
Abstract
The partial information decomposition (PID) aims to quantify the amount of redundant information that a set of sources provides about a target. Here, we show that this goal can be formulated as a type of information bottleneck (IB) problem, termed the "redundancy bottleneck" (RB). The RB formalizes a tradeoff between prediction and compression: it extracts information from the sources that best predict the target, without revealing which source provided the information. It can be understood as a generalization of "Blackwell redundancy", which we previously proposed as a principled measure of PID redundancy. The "RB curve" quantifies the prediction--compression tradeoff at multiple scales. This curve can also be quantified for individual sources, allowing subsets of redundant sources to be identified without combinatorial optimization. We provide an efficient iterative algorithm for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training
