A Highly Parallel FPGA Implementation of Sparse Neural Network Training

Sourya Dey; Diandian Chen; Zongyang Li; Souvik Kundu; Kuan-Wen Huang,; Keith M. Chugg; Peter A. Beerel

arXiv:1806.01087·cs.DC·April 29, 2019

A Highly Parallel FPGA Implementation of Sparse Neural Network Training

Sourya Dey, Diandian Chen, Zongyang Li, Souvik Kundu, Kuan-Wen Huang,, Keith M. Chugg, Peter A. Beerel

PDF

1 Repo

TL;DR

This paper presents a reconfigurable FPGA architecture for sparse neural network training and inference, enabling efficient on-chip processing with reduced complexity and greater hyperparameter exploration.

Contribution

It introduces a highly parallel, reconfigurable FPGA design for sparse neural networks that supports on-chip training and inference with structured sparsity.

Findings

01

Achieved efficient FPGA implementation of sparse neural networks

02

Demonstrated reconfigurability to balance resource use and training speed

03

Enabled extensive hyperparameter exploration on-chip

Abstract

We demonstrate an FPGA implementation of a parallel and reconfigurable architecture for sparse neural networks, capable of on-chip training and inference. The network connectivity uses pre-determined, structured sparsity to significantly reduce complexity by lowering memory and computational requirements. The architecture uses a notion of edge-processing, leading to efficient pipelining and parallelization. Moreover, the device can be reconfigured to trade off resource utilization with training time to fit networks and datasets of varying sizes. The combined effects of complexity reduction and easy reconfigurability enable significantly greater exploration of network hyperparameters and structures on-chip. As proof of concept, we show implementation results on an Artix-7 FPGA.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

souryadey/mlp-ondevice-training
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.