Belief Propagation in Conditional RBMs for Structured Prediction
Wei Ping, Alexander Ihler

TL;DR
This paper introduces a scalable matrix-based belief propagation method for conditional RBMs, demonstrating superior performance over contrastive divergence in structured prediction tasks.
Contribution
It presents a novel, scalable belief propagation implementation for CRBMs, improving inference efficiency and prediction accuracy over traditional contrastive divergence methods.
Findings
BP outperforms CD in structured prediction accuracy
Scalable implementation handles tens of thousands of units
Improved training results in maximum likelihood and max-margin learning
Abstract
Restricted Boltzmann machines~(RBMs) and conditional RBMs~(CRBMs) are popular models for a wide range of applications. In previous work, learning on such models has been dominated by contrastive divergence~(CD) and its variants. Belief propagation~(BP) algorithms are believed to be slow for structured prediction on conditional RBMs~(e.g., Mnih et al. [2011]), and not as good as CD when applied in learning~(e.g., Larochelle et al. [2012]). In this work, we present a matrix-based implementation of belief propagation algorithms on CRBMs, which is easily scalable to tens of thousands of visible and hidden units. We demonstrate that, in both maximum likelihood and max-margin learning, training conditional RBMs with BP as the inference routine can provide significantly better results than current state-of-the-art CD methods on structured prediction problems. We also include practical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Neural Networks and Applications
