Distributed Variational Representation Learning
Inaki Estella Aguerri, Abdellatif Zaidi

TL;DR
This paper extends the Information Bottleneck method to distributed settings, providing theoretical characterizations and practical algorithms for learning representations from multiple sources that preserve maximal information about a target variable.
Contribution
It introduces a theoretical framework for distributed information bottleneck, including explicit characterizations and variational bounds, along with algorithms for practical implementation.
Findings
Explicit optimal tradeoff characterizations for discrete and Gaussian models
Development of a variational bound generalizing ELBO for distributed data
Algorithms demonstrating efficiency on synthetic and real datasets
Abstract
The problem of distributed representation learning is one in which multiple sources of information are processed separately so as to learn as much information as possible about some ground truth . We investigate this problem from information-theoretic grounds, through a generalization of Tishby's centralized Information Bottleneck (IB) method to the distributed setting. Specifically, encoders, , compress their observations separately in a manner such that, collectively, the produced representations preserve as much information as possible about . We study both discrete memoryless (DM) and memoryless vector Gaussian data models. For the discrete model, we establish a single-letter characterization of the optimal tradeoff between complexity (or rate) and relevance (or information) for a class of memoryless sources (the observations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
