Relevant sparse codes with variational information bottleneck
Matthew Chalk, Olivier Marre, Gasper Tkacik

TL;DR
This paper introduces a variational information bottleneck approach to extract relevant, sparse features from high-dimensional data, with kernelized extensions for non-linear relationships, improving computational efficiency.
Contribution
It proposes a novel variational scheme for the information bottleneck, enabling sparse feature extraction and kernelized non-linear modeling.
Findings
Developed an approximate IB algorithm for relevance and sparsity
Derived kernelized versions for non-linear data
Demonstrated effectiveness on complex datasets
Abstract
In many applications, it is desirable to extract only the relevant aspects of data. A principled way to do this is the information bottleneck (IB) method, where one seeks a code that maximizes information about a 'relevance' variable, Y, while constraining the information encoded about the original data, X. Unfortunately however, the IB method is computationally demanding when data are high-dimensional and/or non-gaussian. Here we propose an approximate variational scheme for maximizing a lower bound on the IB objective, analogous to variational EM. Using this method, we derive an IB algorithm to recover features that are both relevant and sparse. Finally, we demonstrate how kernelized versions of the algorithm can be used to address a broad range of problems with non-linear relation between X and Y.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlind Source Separation Techniques · Sparse and Compressive Sensing Techniques · Advanced Vision and Imaging
