Fully-parallel Convolutional Neural Network Hardware
Christiam F. Frasser, Pablo Linares-Serrano, V. Canals, Miquel Roca,, T. Serrano-Gotarredona, Josep L. Rossello

TL;DR
This paper introduces a power- and area-efficient stochastic computing architecture for CNNs, enabling fully-parallel CNNs like LENET-5 to be embedded in FPGA with significant improvements in speed, energy efficiency, and area reduction.
Contribution
The work presents a novel stochastic computing-based CNN architecture that addresses implementation challenges and demonstrates significant hardware efficiency improvements over traditional methods.
Findings
19.6x faster performance on FPGA
6.3x energy efficiency improvement
18x area reduction in VLSI implementation
Abstract
A new trans-disciplinary knowledge area, Edge Artificial Intelligence or Edge Intelligence, is beginning to receive a tremendous amount of interest from the machine learning community due to the ever increasing popularization of the Internet of Things (IoT). Unfortunately, the incorporation of AI characteristics to edge computing devices presents the drawbacks of being power and area hungry for typical machine learning techniques such as Convolutional Neural Networks (CNN). In this work, we propose a new power-and-area-efficient architecture for implementing Articial Neural Networks (ANNs) in hardware, based on the exploitation of correlation phenomenon in Stochastic Computing (SC) systems. The architecture purposed can solve the difficult implementation challenges that SC presents for CNN applications, such as the high resources used in binary-tostochastic conversion, the inaccuracy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Error Correcting Code Techniques · Neural Networks and Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
