Matrix and tensor decompositions for training binary neural networks
Adrian Bulat, Jean Kossaifi, Georgios Tzimiropoulos, Maja, Pantic

TL;DR
This paper introduces a novel approach to training binary neural networks by using matrix and tensor decompositions to improve accuracy while maintaining the benefits of binarization for model compression and inference speed.
Contribution
The method employs latent tensor decomposition for binarization, coupling filters before binarization, and learns scaling factors discriminatively, outperforming prior methods on pose estimation and image classification.
Findings
Achieves over 4% improvement in human pose estimation
Up to 5% performance gains on ImageNet classification
Maintains model compression and inference speed advantages
Abstract
This paper is on improving the training of binary neural networks in which both activations and weights are binary. While prior methods for neural network binarization binarize each filter independently, we propose to instead parametrize the weight tensor of each layer using matrix or tensor decomposition. The binarization process is then performed using this latent parametrization, via a quantization function (e.g. sign function) applied to the reconstructed weights. A key feature of our method is that while the reconstruction is binarized, the computation in the latent factorized space is done in the real domain. This has several advantages: (i) the latent factorization enforces a coupling of the filters before binarization, which significantly improves the accuracy of the trained models. (ii) while at training time, the binary weights of each convolutional layer are parametrized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Tensor decomposition and applications
