Sparse Communication via Mixed Distributions
Ant\'onio Farinhas, Wilker Aziz, Vlad Niculae, Andr\'e F. T., Martins

TL;DR
This paper develops a rigorous theoretical framework for hybrid discrete-continuous random variables, called mixed random variables, enabling better modeling and sampling of mixed representations in neural networks.
Contribution
It introduces a new measure and divergence functions for mixed variables, and proposes two strategies for their representation and sampling, advancing the understanding of hybrid discrete-continuous models.
Findings
Effective modeling of mixed latent variables in VAEs.
Successful experiments on emergent communication and image datasets.
Theoretical foundations for hybrid discrete-continuous distributions.
Abstract
Neural networks and other machine learning models compute continuous representations, while humans communicate mostly through discrete symbols. Reconciling these two forms of communication is desirable for generating human-readable interpretations or learning discrete latent variable models, while maintaining end-to-end differentiability. Some existing approaches (such as the Gumbel-Softmax transformation) build continuous relaxations that are discrete approximations in the zero-temperature limit, while others (such as sparsemax transformations and the Hard Concrete distribution) produce discrete/continuous hybrids. In this paper, we build rigorous theoretical foundations for these hybrids, which we call "mixed random variables." Our starting point is a new "direct sum" base measure defined on the face lattice of the probability simplex. From this measure, we introduce new entropy and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsSparsemax
