A Max-Sum algorithm for training discrete neural networks

Carlo Baldassi; Alfredo Braunstein

arXiv:1505.05401·cond-mat.dis-nn·August 14, 2015

A Max-Sum algorithm for training discrete neural networks

Carlo Baldassi, Alfredo Braunstein

PDF

TL;DR

This paper introduces an efficient Max-Sum algorithm for training discrete neural networks, achieving near state-of-the-art complexity and performance, especially for binary and ternary synapses, without approximations.

Contribution

It develops a scalable Max-Sum algorithm for discrete neural network training, outperforming traditional methods in certain settings and handling symmetries in two-layer networks.

Findings

01

Algorithm scales as O(N log N) per node update.

02

Performs as well as Belief Propagation on binary perceptrons.

03

Potentially better suited for fully-connected two-layer networks.

Abstract

We present an efficient learning algorithm for the problem of training neural networks with discrete synapses, a well-known hard (NP-complete) discrete optimization problem. The algorithm is a variant of the so-called Max-Sum (MS) algorithm. In particular, we show how, for bounded integer weights with $q$ distinct states and independent concave a priori distribution (e.g. $l_{1}$ regularization), the algorithm's time complexity can be made to scale as $O (N lo g N)$ per node update, thus putting it on par with alternative schemes, such as Belief Propagation (BP), without resorting to approximations. Two special cases are of particular interest: binary synapses $W \in {- 1, 1}$ and ternary synapses $W \in {- 1, 0, 1}$ with $l_{0}$ regularization. The algorithm we present performs as well as BP on binary perceptron learning problems, and may be better suited to address the problem on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.