Statistically Valid Information Bottleneck via Multiple Hypothesis   Testing

Amirmohammad Farzaneh; Osvaldo Simeone

arXiv:2409.07325·cs.IT·October 11, 2024

Statistically Valid Information Bottleneck via Multiple Hypothesis Testing

Amirmohammad Farzaneh, Osvaldo Simeone

PDF

Open Access

TL;DR

This paper introduces IB-MHT, a statistically valid method for solving the information bottleneck problem that guarantees IB constraints with high probability, improving robustness over heuristic approaches.

Contribution

It presents a novel IB solution using multiple hypothesis testing that provides statistical guarantees, applicable to classical and deterministic IB problems.

Findings

01

IB-MHT outperforms traditional methods in robustness

02

Provides high-probability guarantees on IB constraints

03

Effective in language model distillation

Abstract

The information bottleneck (IB) problem is a widely studied framework in machine learning for extracting compressed features that are informative for downstream tasks. However, current approaches to solving the IB problem rely on a heuristic tuning of hyperparameters, offering no guarantees that the learned features satisfy information-theoretic constraints. In this work, we introduce a statistically valid solution to this problem, referred to as IB via multiple hypothesis testing (IB-MHT), which ensures that the learned features meet the IB constraints with high probability, regardless of the size of the available dataset. The proposed methodology builds on Pareto testing and learn-then-test (LTT), and it wraps around existing IB solvers to provide statistical guarantees on the IB constraints. We demonstrate the performance of IB-MHT on classical and deterministic IB formulations,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification