Mutual information and task-relevant latent dimensionality
Paarth Gulati, Eslam Abdelaleem, Audrey Sederberg, Ilya Nemenman

TL;DR
This paper introduces a novel information bottleneck-based method to accurately estimate the task-relevant latent dimensionality in neural representations, addressing limitations of existing estimators and demonstrating robustness across synthetic and real datasets.
Contribution
It proposes a hybrid neural mutual information estimator with a one-shot protocol for effective dimensionality estimation, improving accuracy and robustness over traditional methods.
Findings
Standard neural estimators inflate dimension estimates
The hybrid critic preserves latent geometry and reduces bias
The method remains reliable in noisy regimes
Abstract
Estimating the dimensionality of the latent representation needed for prediction -- the task-relevant dimension -- is a difficult, largely unsolved problem with broad scientific applications. We cast it as an Information Bottleneck question: what embedding bottleneck dimension is sufficient to compress predictor and predicted views while preserving their mutual information (MI). This repurposes neural MI estimators for dimensionality estimation. We show that standard neural estimators with separable/bilinear critics systematically inflate the inferred dimension, and we address this by introducing a hybrid critic that retains an explicit dimensional bottleneck while allowing flexible nonlinear cross-view interactions, thereby preserving the latent geometry. We further propose a one-shot protocol that reads off the effective dimension from a single over-parameterized hybrid model, without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis
