Belief Propagation in Conditional RBMs for Structured Prediction

Wei Ping; Alexander Ihler

arXiv:1703.00986·cs.LG·March 6, 2017·2 cites

Belief Propagation in Conditional RBMs for Structured Prediction

Wei Ping, Alexander Ihler

PDF

Open Access

TL;DR

This paper introduces a scalable matrix-based belief propagation method for conditional RBMs, demonstrating superior performance over contrastive divergence in structured prediction tasks.

Contribution

It presents a novel, scalable belief propagation implementation for CRBMs, improving inference efficiency and prediction accuracy over traditional contrastive divergence methods.

Findings

01

BP outperforms CD in structured prediction accuracy

02

Scalable implementation handles tens of thousands of units

03

Improved training results in maximum likelihood and max-margin learning

Abstract

Restricted Boltzmann machines~(RBMs) and conditional RBMs~(CRBMs) are popular models for a wide range of applications. In previous work, learning on such models has been dominated by contrastive divergence~(CD) and its variants. Belief propagation~(BP) algorithms are believed to be slow for structured prediction on conditional RBMs~(e.g., Mnih et al. [2011]), and not as good as CD when applied in learning~(e.g., Larochelle et al. [2012]). In this work, we present a matrix-based implementation of belief propagation algorithms on CRBMs, which is easily scalable to tens of thousands of visible and hidden units. We demonstrate that, in both maximum likelihood and max-margin learning, training conditional RBMs with BP as the inference routine can provide significantly better results than current state-of-the-art CD methods on structured prediction problems. We also include practical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Neural Networks and Applications