TL;DR
FracBNN introduces fractional activations and a novel input encoding to significantly improve the accuracy of binary neural networks on FPGA hardware, enabling real-time image classification with reduced model size.
Contribution
The paper proposes a new BNN architecture with fractional activations and a thermometer encoding input scheme, achieving higher accuracy and FPGA efficiency.
Findings
28.9% increase in top-1 accuracy over previous BNNs on ImageNet
2.5x reduction in model size compared to prior models
Real-time image classification on embedded FPGA
Abstract
Binary neural networks (BNNs) have 1-bit weights and activations. Such networks are well suited for FPGAs, as their dominant computations are bitwise arithmetic and the memory requirement is also significantly reduced. However, compared to start-of-the-art compact convolutional neural network (CNN) models, BNNs tend to produce a much lower accuracy on realistic datasets such as ImageNet. In addition, the input layer of BNNs has gradually become a major compute bottleneck, because it is conventionally excluded from binarization to avoid a large accuracy loss. This work proposes FracBNN, which exploits fractional activations to substantially improve the accuracy of BNNs. Specifically, our approach employs a dual-precision activation scheme to compute features with up to two bits, using an additional sparse binary convolution. We further binarize the input layer using a novel thermometer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPointwise Convolution · Depthwise Convolution · Depthwise Separable Convolution · Batch Normalization · Average Pooling · Inverted Residual Block · Convolution · 1x1 Convolution · Tether Customer Service Number +1-833-534-1729
