Efficient non-uniform quantizer for quantized neural network targeting reconfigurable hardware
Natan Liss, Chaim Baskin, Avi Mendelson, Alex M. Bronstein, Raja, Giryes

TL;DR
This paper introduces a hardware-efficient non-uniform quantizer for FPGA-based CNN accelerators, enabling low-power, real-time inference with minimal accuracy loss on image classification tasks.
Contribution
It proposes a novel, hardware-friendly non-uniform quantization method using a single scale integer representation for parameters and activations.
Findings
Achieved minimal accuracy degradation on CIFAR datasets.
Demonstrated suitability for real-time FPGA applications.
Outperformed uniform quantization methods in efficiency.
Abstract
Convolutional Neural Networks (CNN) has become more popular choice for various tasks such as computer vision, speech recognition and natural language processing. Thanks to their large computational capability and throughput, GPUs ,which are not power efficient and therefore does not suit low power systems such as mobile devices, are the most common platform for both training and inferencing tasks. Recent studies has shown that FPGAs can provide a good alternative to GPUs as a CNN accelerator, due to their re-configurable nature, low power and small latency. In order for FPGA-based accelerators outperform GPUs in inference task, both the parameters of the network and the activations must be quantized. While most works use uniform quantizers for both parameters and activations, it is not always the optimal one, and a non-uniform quantizer need to be considered. In this work we introduce a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · CCD and CMOS Imaging Sensors · Advanced Image and Video Retrieval Techniques
