On better training the infinite restricted Boltzmann machines

Xuan Peng; Xunzhang Gao; Xiang Li

arXiv:1709.03239·cs.LG·July 30, 2018

On better training the infinite restricted Boltzmann machines

Xuan Peng, Xunzhang Gao, Xiang Li

PDF

1 Repo

TL;DR

This paper introduces a novel training strategy for infinite restricted Boltzmann machines (iRBMs) that involves randomly regrouping hidden units to accelerate convergence and improve generalization, making iRBMs more practical.

Contribution

The paper proposes a new training method for iRBMs that reduces training time and enhances generalization by random permutation of hidden units during learning.

Findings

01

Training speed is significantly improved.

02

Model generalization is enhanced.

03

Effective on datasets like MNIST and CalTech101.

Abstract

The infinite restricted Boltzmann machine (iRBM) is an extension of the classic RBM. It enjoys a good property of automatically deciding the size of the hidden layer according to specific training data. With sufficient training, the iRBM can achieve a competitive performance with that of the classic RBM. However, the convergence of learning the iRBM is slow, due to the fact that the iRBM is sensitive to the ordering of its hidden units, the learned filters change slowly from the left-most hidden unit to right. To break this dependency between neighboring hidden units and speed up the convergence of training, a novel training strategy is proposed. The key idea of the proposed training strategy is randomly regrouping the hidden units before each gradient descent step. Potentially, a mixing of infinite many iRBMs with different permutations of the hidden units can be achieved by this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Boltzxuann/RP-iRBM
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Restricted Boltzmann Machine