Identifiability of Product of Experts Models
Spencer L. Gordon, Manav Kant, Eric Ma, Leonard J. Schulman, Andrei, Staicu

TL;DR
This paper establishes conditions under which product of experts models with binary latent and observable variables are identifiable, showing that the number of observables needed is linear in the number of parameters, improving previous bounds.
Contribution
It proves that PoE models with binary variables are identifiable with a linear number of observables, matching the best possible bounds, for both uniform and arbitrary latent distributions.
Findings
Identifiability with a number of observables equal to parameters for uniform latents.
Identifiability with a linear number of observables for arbitrary latents.
Use of root interlacing phenomena in proofs.
Abstract
Product of experts (PoE) are layered networks in which the value at each node is an AND (or product) of the values (possibly negated) at its inputs. These were introduced as a neural network architecture that can efficiently learn to generate high-dimensional data which satisfy many low-dimensional constraints -- thereby allowing each individual expert to perform a simple task. PoEs have found a variety of applications in learning. We study the problem of identifiability of a product of experts model having a layer of binary latent variables, and a layer of binary observables that are iid conditional on the latents. The previous best upper bound on the number of observables needed to identify the model was exponential in the number of parameters. We show: (a) When the latents are uniformly distributed, the model is identifiable with a number of observables equal to the number of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Neural Networks and Applications · Bayesian Modeling and Causal Inference
