Neural Bayes: A Generic Parameterization Method for Unsupervised Representation Learning
Devansh Arpit, Huan Wang, Caiming Xiong, Richard Socher, Yoshua Bengio

TL;DR
Neural Bayes introduces a flexible parameterization technique enabling closed-form computation of complex statistical quantities, facilitating new objectives in unsupervised representation learning such as mutual information maximization and manifold clustering.
Contribution
It provides a novel neural network-based parameterization method that allows exact computation of distributions and mutual information, advancing unsupervised learning and clustering methods.
Findings
Closed-form mutual information computation improves representation learning.
Effective disjoint manifold clustering demonstrates the method's practical utility.
The approach enables new unsupervised learning objectives with theoretical guarantees.
Abstract
We introduce a parameterization method called Neural Bayes which allows computing statistical quantities that are in general difficult to compute and opens avenues for formulating new objectives for unsupervised representation learning. Specifically, given an observed random variable and a latent discrete variable , we can express , and in closed form in terms of a sufficiently expressive function (Eg. neural network) using our parameterization without restricting the class of these distributions. To demonstrate its usefulness, we develop two independent use cases for this parameterization: 1. Mutual Information Maximization (MIM): MIM has become a popular means for self-supervised representation learning. Neural Bayes allows us to compute mutual information between observed random variables and latent discrete…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Neural Networks and Applications · Human Pose and Action Recognition
MethodsMutual Information Machine/Mask Image Modeling
