VLSI Implementation of Deep Neural Network Using Integral Stochastic   Computing

Arash Ardakani; Fran\c{c}ois Leduc-Primeau; Naoya Onizawa; Takahiro; Hanyu; Warren J. Gross

arXiv:1509.08972·cs.NE·March 22, 2017

VLSI Implementation of Deep Neural Network Using Integral Stochastic Computing

Arash Ardakani, Fran\c{c}ois Leduc-Primeau, Naoya Onizawa, Takahiro, Hanyu, Warren J. Gross

PDF

TL;DR

This paper presents an integral stochastic computing approach for deep neural networks, achieving significant reductions in area, latency, and energy consumption on FPGA and CMOS implementations, enhancing efficiency and fault tolerance.

Contribution

It introduces an integer form of stochastic computation and an efficient DNN architecture, improving upon existing stochastic methods in hardware efficiency and energy savings.

Findings

01

45% reduction in area on FPGA

02

62% reduction in latency on FPGA

03

21% reduction in energy consumption in CMOS

Abstract

The hardware implementation of deep neural networks (DNNs) has recently received tremendous attention: many applications in fact require high-speed operations that suit a hardware implementation. However, numerous elements and complex interconnections are usually required, leading to a large area occupation and copious power consumption. Stochastic computing has shown promising results for low-power area-efficient hardware implementations, even though existing stochastic algorithms require long streams that cause long latencies. In this paper, we propose an integer form of stochastic computation and introduce some elementary circuits. We then propose an efficient implementation of a DNN based on integral stochastic computing. The proposed architecture has been implemented on a Virtex7 FPGA, resulting in 45% and 62% average reductions in area and latency compared to the best reported…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.