Build a Compact Binary Neural Network through Bit-level Sensitivity and Data Pruning
Yixing Li, Fengbo Ren

TL;DR
This paper introduces a method to create a more compact Binarized Neural Network by analyzing bit-level sensitivity and pruning redundant data, significantly reducing size and runtime with minimal accuracy loss.
Contribution
The work presents a novel approach combining bit-level sensitivity analysis and data pruning to shrink BNN size beyond existing methods.
Findings
Network size reduced up to 3.9x
Runtime decreased up to 2x and 9.9x
Accuracy drop limited to 1%
Abstract
Convolutional neural network (CNN) has been widely used for vision-based tasks. Due to the high computational complexity and memory storage requirement, it is hard to directly deploy a full-precision CNN on embedded devices. The hardware-friendly designs are needed for re-source-limited and energy-constrained embed-ded devices. Emerging solutions are adopted for the neural network compression, e.g., bina-ry/ternary weight network, pruned network and quantized network. Among them, Binarized Neural Network (BNN) is believed to be the most hardware-friendly framework due to its small network size and low computational com-plexity. No existing work has further shrunk the size of BNN. In this work, we explore the redun-dancy in BNN and build a compact BNN (CBNN) based on the bit-level sensitivity analy-sis and bit-level data pruning. The input data is converted to a high dimensional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · CCD and CMOS Imaging Sensors
MethodsPruning
