Learning Infinite RBMs with Frank-Wolfe
Wei Ping, Qiang Liu, Alexander Ihler

TL;DR
This paper introduces an infinite RBM model optimized via Frank-Wolfe, enabling adaptive complexity and improved training initialization, resulting in higher test likelihoods.
Contribution
It proposes a novel infinite RBM framework with Frank-Wolfe optimization, allowing automatic model complexity control and better initialization for training.
Findings
Sparse solutions with increasing hidden units during optimization
Efficient identification of the optimal number of hidden units
Higher test likelihoods compared to random initialization
Abstract
In this work, we propose an infinite restricted Boltzmann machine~(RBM), whose maximum likelihood estimation~(MLE) corresponds to a constrained convex optimization. We consider the Frank-Wolfe algorithm to solve the program, which provides a sparse solution that can be interpreted as inserting a hidden unit at each iteration, so that the optimization process takes the form of a sequence of finite models of increasing complexity. As a side benefit, this can be used to easily and efficiently identify an appropriate number of hidden units during the optimization. The resulting model can also be used as an initialization for typical state-of-the-art RBM training algorithms such as contrastive divergence, leading to models with consistently higher test likelihood than random initialization.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques
