Deep Neural Networks Based Weight Approximation and Computation Reuse   for 2-D Image Classification

Mohammed F. Tolba; Huruy Tekle Tesfai; Hani Saleh; Baker Mohammad; and; Mahmoud Al-Qutayri

arXiv:2105.02954·cs.CV·May 10, 2021

Deep Neural Networks Based Weight Approximation and Computation Reuse for 2-D Image Classification

Mohammed F. Tolba, Huruy Tekle Tesfai, Hani Saleh, Baker Mohammad, and, Mahmoud Al-Qutayri

PDF

Open Access

TL;DR

This paper proposes a novel method combining approximate computing and data reuse in DNNs to enhance image classification efficiency on resource-constrained devices, significantly reducing parameters and hardware requirements with minimal accuracy loss.

Contribution

It introduces a weight approximation technique during training that enables computational and data reuse, leading to substantial reductions in parameters and hardware resources for DNN inference.

Findings

01

Achieved 1211.3x reduction in parameters on LeNet 5 with less than 0.9% accuracy loss.

02

Saved 54% of adders and multipliers compared to the Row Stationary method.

03

Reduced memory size and access requirements, suitable for IoT edge devices.

Abstract

Deep Neural Networks (DNNs) are computationally and memory intensive, which makes their hardware implementation a challenging task especially for resource constrained devices such as IoT nodes. To address this challenge, this paper introduces a new method to improve DNNs performance by fusing approximate computing with data reuse techniques to be used for image recognition applications. DNNs weights are approximated based on the linear and quadratic approximation methods during the training phase, then, all of the weights are replaced with the linear/quadratic coefficients to execute the inference in a way where different weights could be computed using the same coefficients. This leads to a repetition of the weights across the processing element (PE) array, which in turn enables the reuse of the DNN sub-computations (computational reuse) and leverage the same data (data reuse) to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Advanced Neural Network Applications · Stochastic Gradient Optimization Techniques