A Targeted Acceleration and Compression Framework for Low bit Neural Networks
Biao Qian, Yang Wang

TL;DR
This paper introduces a novel framework called TAC that enhances 1-bit deep neural networks by separately optimizing convolutional and fully connected layers through binarization, pruning, and low-bit quantization, significantly improving accuracy.
Contribution
The TAC framework uniquely separates and optimizes convolutional and fully connected layers, combining binarization with pruning and quantization to boost 1-bit neural network performance.
Findings
TAC improves 1-bit DNN accuracy by over 6 percentage points.
Experimental results on CIFAR and ImageNet datasets validate effectiveness.
Outperforms existing methods in low-bit neural network accuracy.
Abstract
1 bit deep neural networks (DNNs), of which both the activations and weights are binarized , are attracting more and more attention due to their high computational efficiency and low memory requirement . However, the drawback of large accuracy dropping also restrict s its application. In this paper, we propose a novel Targeted Acceleration and Compression (TAC) framework to improve the performance of 1 bit deep neural networks W e consider that the acceleration and compression effects of binarizing fully connected layer s are not sufficient to compensate for the accuracy loss caused by it In the proposed framework, t he convolutional and fully connected layer are separated and optimized i ndividually . F or the convolutional layer s , both the activations and weights are binarized. For the fully connected layer s, the binarization operation is re placed by network pruning and low bit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsPruning
