Flexible Network Binarization with Layer-wise Priority
Lixue Zhuang, Yi Xu, Bingbing Ni, Hongteng Xu

TL;DR
This paper introduces a flexible layer-wise priority method for neural network binarization, improving compression and performance trade-offs, and demonstrating superior pedestrian detection results compared to existing methods.
Contribution
It proposes a novel binarization approach that prioritizes layers based on their impact, allowing flexible trade-offs between accuracy and compression.
Findings
Achieves lower miss rate than state-of-the-art methods.
Faster detection speed with comparable or better accuracy.
Effective in applying to pedestrian detection benchmarks.
Abstract
How to effectively approximate real-valued parameters with binary codes plays a central role in neural network binarization. In this work, we reveal an important fact that binarizing different layers has a widely-varied effect on the compression ratio of network and the loss of performance. Based on this fact, we propose a novel and flexible neural network binarization method by introducing the concept of layer-wise priority which binarizes parameters in inverse order of their layer depth. In each training step, our method selects a specific network layer, minimizes the discrepancy between the original real-valued weights and its binary approximations, and fine-tunes the whole network accordingly. During the iteration of the above process, it is significant that we can flexibly decide whether to binarize the remaining floating layers or not and explore a trade-off between the loss of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Video Surveillance and Tracking Methods
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
