Deep Neural Networks Based Weight Approximation and Computation Reuse for 2-D Image Classification
Mohammed F. Tolba, Huruy Tekle Tesfai, Hani Saleh, Baker Mohammad, and, Mahmoud Al-Qutayri

TL;DR
This paper proposes a novel method combining approximate computing and data reuse in DNNs to enhance image classification efficiency on resource-constrained devices, significantly reducing parameters and hardware requirements with minimal accuracy loss.
Contribution
It introduces a weight approximation technique during training that enables computational and data reuse, leading to substantial reductions in parameters and hardware resources for DNN inference.
Findings
Achieved 1211.3x reduction in parameters on LeNet 5 with less than 0.9% accuracy loss.
Saved 54% of adders and multipliers compared to the Row Stationary method.
Reduced memory size and access requirements, suitable for IoT edge devices.
Abstract
Deep Neural Networks (DNNs) are computationally and memory intensive, which makes their hardware implementation a challenging task especially for resource constrained devices such as IoT nodes. To address this challenge, this paper introduces a new method to improve DNNs performance by fusing approximate computing with data reuse techniques to be used for image recognition applications. DNNs weights are approximated based on the linear and quadratic approximation methods during the training phase, then, all of the weights are replaced with the linear/quadratic coefficients to execute the inference in a way where different weights could be computed using the same coefficients. This leads to a repetition of the weights across the processing element (PE) array, which in turn enables the reuse of the DNN sub-computations (computational reuse) and leverage the same data (data reuse) to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Advanced Neural Network Applications · Stochastic Gradient Optimization Techniques
