Conditional Restricted Boltzmann Machines for Structured Output Prediction
Volodymyr Mnih, Hugo Larochelle, Geoffrey E. Hinton

TL;DR
This paper introduces improved training algorithms for Conditional Restricted Boltzmann Machines (CRBMs) tailored for structured output prediction tasks, addressing limitations of traditional methods and demonstrating superior performance.
Contribution
The paper proposes novel learning algorithms for CRBMs that outperform Contrastive Divergence in structured output prediction problems with different output space complexities.
Findings
New algorithms outperform Contrastive Divergence in experiments.
Effective for both small and large structured output spaces.
Applicable to tasks like multi-label classification and image denoising.
Abstract
Conditional Restricted Boltzmann Machines (CRBMs) are rich probabilistic models that have recently been applied to a wide range of problems, including collaborative filtering, classification, and modeling motion capture data. While much progress has been made in training non-conditional RBMs, these algorithms are not applicable to conditional models and there has been almost no work on training and generating predictions from conditional RBMs for structured output problems. We first argue that standard Contrastive Divergence-based learning may not be suitable for training CRBMs. We then identify two distinct types of structured output prediction problems and propose an improved learning algorithm for each. The first problem type is one where the output space has arbitrary structure but the set of likely output configurations is relatively small, such as in multi-label classification.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Music and Audio Processing · Domain Adaptation and Few-Shot Learning
