FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with   Fractional Activations

Yichi Zhang; Junhao Pan; Xinheng Liu; Hongzheng Chen and; Deming Chen; Zhiru Zhang

arXiv:2012.12206·cs.LG·December 23, 2020

FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations

Yichi Zhang, Junhao Pan, Xinheng Liu, Hongzheng Chen and, Deming Chen, Zhiru Zhang

PDF

2 Repos

TL;DR

FracBNN introduces fractional activations and a novel input encoding to significantly improve the accuracy of binary neural networks on FPGA hardware, enabling real-time image classification with reduced model size.

Contribution

The paper proposes a new BNN architecture with fractional activations and a thermometer encoding input scheme, achieving higher accuracy and FPGA efficiency.

Findings

01

28.9% increase in top-1 accuracy over previous BNNs on ImageNet

02

2.5x reduction in model size compared to prior models

03

Real-time image classification on embedded FPGA

Abstract

Binary neural networks (BNNs) have 1-bit weights and activations. Such networks are well suited for FPGAs, as their dominant computations are bitwise arithmetic and the memory requirement is also significantly reduced. However, compared to start-of-the-art compact convolutional neural network (CNN) models, BNNs tend to produce a much lower accuracy on realistic datasets such as ImageNet. In addition, the input layer of BNNs has gradually become a major compute bottleneck, because it is conventionally excluded from binarization to avoid a large accuracy loss. This work proposes FracBNN, which exploits fractional activations to substantially improve the accuracy of BNNs. Specifically, our approach employs a dual-precision activation scheme to compute features with up to two bits, using an additional sparse binary convolution. We further binarize the input layer using a novel thermometer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPointwise Convolution · Depthwise Convolution · Depthwise Separable Convolution · Batch Normalization · Average Pooling · Inverted Residual Block · Convolution · 1x1 Convolution · Tether Customer Service Number +1-833-534-1729