Optimize Deep Convolutional Neural Network with Ternarized Weights and   High Accuracy

Zhezhi He; Boqing Gong; Deliang Fan

arXiv:1807.07948·cs.CV·July 23, 2018

Optimize Deep Convolutional Neural Network with Ternarized Weights and High Accuracy

Zhezhi He, Boqing Gong, Deliang Fan

PDF

Open Access

TL;DR

This paper introduces a method for ternarizing deep CNN weights to significantly reduce model size and computation, while maintaining or improving accuracy on CIFAR-10 and ImageNet datasets.

Contribution

It proposes statistical weight scaling and residual expansion techniques for effective ternarization, achieving high compression rates with minimal accuracy loss or gains.

Findings

01

Achieves 16x model compression with slight accuracy improvements on CIFAR-10.

02

Outperforms recent methods on ImageNet with similar compression rates.

03

Further residual expansion improves top-5 accuracy with minimal top-1 degradation.

Abstract

Deep convolution neural network has achieved great success in many artificial intelligence applications. However, its enormous model size and massive computation cost have become the main obstacle for deployment of such powerful algorithm in the low power and resource-limited embedded systems. As the countermeasure to this problem, in this work, we propose statistical weight scaling and residual expansion methods to reduce the bit-width of the whole network weight parameters to ternary values (i.e. -1, 0, +1), with the objectives to greatly reduce model size, computation cost and accuracy degradation caused by the model compression. With about 16x model compression rate, our ternarized ResNet-32/44/56 could outperform full-precision counterparts by 0.12%, 0.24% and 0.18% on CIFAR- 10 dataset. We also test our ternarization method with AlexNet and ResNet-18 on ImageNet dataset, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

Methods1x1 Convolution · Local Response Normalization · Grouped Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Dense Connections · Max Pooling · Softmax · How do I speak to a person at Expedia?-/+/ · Convolution