Optimize Deep Convolutional Neural Network with Ternarized Weights and High Accuracy
Zhezhi He, Boqing Gong, Deliang Fan

TL;DR
This paper introduces a method for ternarizing deep CNN weights to significantly reduce model size and computation, while maintaining or improving accuracy on CIFAR-10 and ImageNet datasets.
Contribution
It proposes statistical weight scaling and residual expansion techniques for effective ternarization, achieving high compression rates with minimal accuracy loss or gains.
Findings
Achieves 16x model compression with slight accuracy improvements on CIFAR-10.
Outperforms recent methods on ImageNet with similar compression rates.
Further residual expansion improves top-5 accuracy with minimal top-1 degradation.
Abstract
Deep convolution neural network has achieved great success in many artificial intelligence applications. However, its enormous model size and massive computation cost have become the main obstacle for deployment of such powerful algorithm in the low power and resource-limited embedded systems. As the countermeasure to this problem, in this work, we propose statistical weight scaling and residual expansion methods to reduce the bit-width of the whole network weight parameters to ternary values (i.e. -1, 0, +1), with the objectives to greatly reduce model size, computation cost and accuracy degradation caused by the model compression. With about 16x model compression rate, our ternarized ResNet-32/44/56 could outperform full-precision counterparts by 0.12%, 0.24% and 0.18% on CIFAR- 10 dataset. We also test our ternarization method with AlexNet and ResNet-18 on ImageNet dataset, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
Methods1x1 Convolution · Local Response Normalization · Grouped Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Dense Connections · Max Pooling · Softmax · How do I speak to a person at Expedia?-/+/ · Convolution
