Learning Gaussian-Bernoulli RBMs using Difference of Convex Functions Optimization
Vidyadhar Upadhya, P S Sastry

TL;DR
This paper introduces a novel stochastic difference of convex functions (S-DCP) algorithm for learning Gaussian-Bernoulli RBMs, improving training speed and model quality over traditional contrastive divergence methods.
Contribution
The paper proposes a new S-DCP algorithm that reformulates the negative log-likelihood as a difference of convex functions, enabling more efficient training of GB-RBMs.
Findings
S-DCP outperforms CD and PCD in training speed.
S-DCP produces higher quality generative models.
Empirical validation on benchmark datasets confirms effectiveness.
Abstract
The Gaussian-Bernoulli restricted Boltzmann machine (GB-RBM) is a useful generative model that captures meaningful features from the given -dimensional continuous data. The difficulties associated with learning GB-RBM are reported extensively in earlier studies. They indicate that the training of the GB-RBM using the current standard algorithms, namely, contrastive divergence (CD) and persistent contrastive divergence (PCD), needs a carefully chosen small learning rate to avoid divergence which, in turn, results in slow learning. In this work, we alleviate such difficulties by showing that the negative log-likelihood for a GB-RBM can be expressed as a difference of convex functions if we keep the variance of the conditional distribution of visible units (given hidden unit states) and the biases of the visible units, constant. Using this, we propose a stochastic {\em difference of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Domain Adaptation and Few-Shot Learning
MethodsRestricted Boltzmann Machine
