Mutual information and task-relevant latent dimensionality

Paarth Gulati; Eslam Abdelaleem; Audrey Sederberg; Ilya Nemenman

arXiv:2602.08105·cs.LG·February 10, 2026

Mutual information and task-relevant latent dimensionality

Paarth Gulati, Eslam Abdelaleem, Audrey Sederberg, Ilya Nemenman

PDF

Open Access

TL;DR

This paper introduces a novel information bottleneck-based method to accurately estimate the task-relevant latent dimensionality in neural representations, addressing limitations of existing estimators and demonstrating robustness across synthetic and real datasets.

Contribution

It proposes a hybrid neural mutual information estimator with a one-shot protocol for effective dimensionality estimation, improving accuracy and robustness over traditional methods.

Findings

01

Standard neural estimators inflate dimension estimates

02

The hybrid critic preserves latent geometry and reduces bias

03

The method remains reliable in noisy regimes

Abstract

Estimating the dimensionality of the latent representation needed for prediction -- the task-relevant dimension -- is a difficult, largely unsolved problem with broad scientific applications. We cast it as an Information Bottleneck question: what embedding bottleneck dimension is sufficient to compress predictor and predicted views while preserving their mutual information (MI). This repurposes neural MI estimators for dimensionality estimation. We show that standard neural estimators with separable/bilinear critics systematically inflate the inferred dimension, and we address this by introducing a hybrid critic that retains an explicit dimensional bottleneck while allowing flexible nonlinear cross-view interactions, thereby preserving the latent geometry. We further propose a one-shot protocol that reads off the effective dimension from a single over-parameterized hybrid model, without…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis