Exploration of Low Numeric Precision Deep Learning Inference Using Intel FPGAs
Philip Colangelo, Nasibeh Nasiri, Asit Mishra, Eriko Nurvitadhi,, Martin Margala, Kevin Nealis

TL;DR
This paper presents FPGA-based hardware designs that leverage low numeric precision, such as binary and ternary weights, to significantly improve inference speed and efficiency in CNNs while maintaining acceptable accuracy levels.
Contribution
It introduces a novel FPGA framework for low numeric precision CNN inference, demonstrating practical implementations and performance projections for various network architectures.
Findings
Achieved 3,700 images/sec on ImageNet with AlexNet at 2-bit activation and ternary weights.
Projected 55.5 TOPS performance on Stratix 10 with minimal accuracy loss on ResNet-34.
Showed that low precision inference can be efficiently mapped onto FPGA hardware.
Abstract
CNNs have been shown to maintain reasonable classification accuracy when quantized to lower precisions. Quantizing to sub 8-bit activations and weights can result in accuracy falling below an acceptable threshold. Techniques exist for closing the accuracy gap of limited numeric precision typically by increasing computation. This results in a trade-off between throughput and accuracy and can be tailored for different networks through various combinations of activation and weight data widths. Hardware architectures like FPGAs provide the opportunity for data width specific computation through unique logic configurations leading to highly optimized processing that is unattainable by full precision networks. Ternary and binary weighted networks offer an efficient method of inference for 2-bit and 1-bit data respectively. Most hardware architectures can take advantage of the memory storage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · CCD and CMOS Imaging Sensors · Neural Networks and Applications
