Fully-parallel Convolutional Neural Network Hardware

Christiam F. Frasser; Pablo Linares-Serrano; V. Canals; Miquel Roca,; T. Serrano-Gotarredona; Josep L. Rossello

arXiv:2006.12439·cs.NE·June 23, 2020·1 cites

Fully-parallel Convolutional Neural Network Hardware

Christiam F. Frasser, Pablo Linares-Serrano, V. Canals, Miquel Roca,, T. Serrano-Gotarredona, Josep L. Rossello

PDF

Open Access

TL;DR

This paper introduces a power- and area-efficient stochastic computing architecture for CNNs, enabling fully-parallel CNNs like LENET-5 to be embedded in FPGA with significant improvements in speed, energy efficiency, and area reduction.

Contribution

The work presents a novel stochastic computing-based CNN architecture that addresses implementation challenges and demonstrates significant hardware efficiency improvements over traditional methods.

Findings

01

19.6x faster performance on FPGA

02

6.3x energy efficiency improvement

03

18x area reduction in VLSI implementation

Abstract

A new trans-disciplinary knowledge area, Edge Artificial Intelligence or Edge Intelligence, is beginning to receive a tremendous amount of interest from the machine learning community due to the ever increasing popularization of the Internet of Things (IoT). Unfortunately, the incorporation of AI characteristics to edge computing devices presents the drawbacks of being power and area hungry for typical machine learning techniques such as Convolutional Neural Networks (CNN). In this work, we propose a new power-and-area-efficient architecture for implementing Articial Neural Networks (ANNs) in hardware, based on the exploitation of correlation phenomenon in Stochastic Computing (SC) systems. The architecture purposed can solve the difficult implementation challenges that SC presents for CNN applications, such as the high resources used in binary-tostochastic conversion, the inaccuracy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Error Correcting Code Techniques · Neural Networks and Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings