Controlling Information Capacity of Binary Neural Network

Dmitry Ignatov; Andrey Ignatov

arXiv:2008.01438·cs.CV·August 5, 2020

Controlling Information Capacity of Binary Neural Network

Dmitry Ignatov, Andrey Ignatov

PDF

TL;DR

This paper introduces a Shannon entropy-based penalty method to train binary neural networks, maintaining their information capacity and significantly improving accuracy on multiple datasets.

Contribution

It presents a novel training approach that stabilizes information capacity in binary networks, reducing accuracy degradation.

Findings

01

Significant accuracy improvements on SVHN, CIFAR, and ImageNet datasets.

02

Maintains stable information capacity during training.

03

Outperforms existing binary network training methods.

Abstract

Despite the growing popularity of deep learning technologies, high memory requirements and power consumption are essentially limiting their application in mobile and IoT areas. While binary convolutional networks can alleviate these problems, the limited bitwidth of weights is often leading to significant degradation of prediction accuracy. In this paper, we present a method for training binary networks that maintains a stable predefined level of their information capacity throughout the training process by applying Shannon entropy based penalty to convolutional filters. The results of experiments conducted on SVHN, CIFAR and ImageNet datasets demonstrate that the proposed approach can statistically significantly improve the accuracy of binary networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.