Accelerating Deep Learning with Memcomputing

Haik Manukian; Fabio L. Traversa; Massimiliano Di Ventra

arXiv:1801.00512·cs.LG·October 25, 2018

Accelerating Deep Learning with Memcomputing

Haik Manukian, Fabio L. Traversa, Massimiliano Di Ventra

PDF

TL;DR

This paper introduces a memcomputing-based method to accelerate training of deep belief networks, achieving faster convergence and higher accuracy than traditional contrastive divergence and quantum annealing methods.

Contribution

The authors demonstrate that digital memcomputing machines can efficiently sample the model distribution in RBMs, significantly reducing training iterations and improving performance over existing methods.

Findings

01

DMMs provide near-optimal sampling of RBM distributions.

02

The method reduces pretraining iterations and improves accuracy.

03

Performance surpasses quantum annealing and other supervised training improvements.

Abstract

Restricted Boltzmann machines (RBMs) and their extensions, called 'deep-belief networks', are powerful neural networks that have found applications in the fields of machine learning and artificial intelligence. The standard way to training these models resorts to an iterative unsupervised procedure based on Gibbs sampling, called 'contrastive divergence' (CD), and additional supervised tuning via back-propagation. However, this procedure has been shown not to follow any gradient and can lead to suboptimal solutions. In this paper, we show an efficient alternative to CD by means of simulations of digital memcomputing machines (DMMs). We test our approach on pattern recognition using a modified version of the MNIST data set. DMMs sample effectively the vast phase space given by the model distribution of the RBM, and provide a very good approximation close to the optimum. This efficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.