Hardware-aware Pruning of DNNs using LFSR-Generated Pseudo-Random   Indices

Foroozan Karimzadeh; Ningyuan Cao; Brian Crafton; Justin Romberg,; Arijit Raychowdhury

arXiv:1911.04468·cs.LG·November 13, 2019

Hardware-aware Pruning of DNNs using LFSR-Generated Pseudo-Random Indices

Foroozan Karimzadeh, Ningyuan Cao, Brian Crafton, Justin Romberg,, Arijit Raychowdhury

PDF

TL;DR

This paper introduces a hardware-aware DNN pruning method using LFSRs to generate non-zero weight locations in real-time, significantly reducing energy and area consumption for embedded applications.

Contribution

The novel approach leverages LFSRs for real-time, hardware-efficient pruning, reducing overhead compared to traditional sparsification techniques.

Findings

01

Energy savings up to 63.96%

02

Area reduction up to 64.23%

03

Effective on VGG-16 with ImageNet data

Abstract

Deep neural networks (DNNs) have been emerged as the state-of-the-art algorithms in broad range of applications. To reduce the memory foot-print of DNNs, in particular for embedded applications, sparsification techniques have been proposed. Unfortunately, these techniques come with a large hardware overhead. In this paper, we present a hardware-aware pruning method where the locations of non-zero weights are derived in real-time from a Linear Feedback Shift Registers (LFSRs). Using the proposed method, we demonstrate a total saving of energy and area up to 63.96% and 64.23% for VGG-16 network on down-sampled ImageNet, respectively for iso-compression-rate and iso-accuracy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPruning