Field-Programmable Deep Neural Network (DNN) Learning and Inference   accelerator: a concept

Luiz M Franca-Neto

arXiv:1802.04899·cs.LG·March 26, 2018·1 cites

Field-Programmable Deep Neural Network (DNN) Learning and Inference accelerator: a concept

Luiz M Franca-Neto

PDF

Open Access

TL;DR

This paper proposes a reconfigurable, pipelined FPGA-based accelerator for DNNs that significantly speeds up learning and inference by optimizing resource allocation per layer, achieving over 50x speedup over GPUs.

Contribution

It introduces a novel reconfigurable architecture combining hybrid systolic techniques and deep pipelining for flexible DNN acceleration.

Findings

01

Achieves over 50x speedup compared to GPUs and TPUs.

02

Demonstrates flexibility with VGG-16 and Inception modules.

03

Validates design through behavioral-functional simulation.

Abstract

An accelerator is a specialized integrated circuit designed to perform specific computations faster than if those were performed by CPU or GPU. A Field-Programmable DNN learning and inference accelerator (FProg-DNN) using hybrid systolic and non-systolic techniques, distributed information-control and deep pipelined structure is proposed and its microarchitecture and operation presented here. Reconfigurability attends diverse DNN designs and allows for different number of workers to be assigned to different layers as a function of the relative difference in computational load among layers. The computational delay per layer is made roughly the same along pipelined accelerator structure. VGG-16 and recently proposed Inception Modules are used for showing the flexibility of the FProg-DNN reconfigurability. Special structures were also added for a combination of convolution layer, map…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Neural Networks and Applications · CCD and CMOS Imaging Sensors

MethodsConvolution