Efficient Hardware Realizations of Feedforward Artificial Neural Networks
Mohammadreza Esmali Nojehdeh, Sajjad Parvin, Mustafa Altun

TL;DR
This paper introduces efficient hardware design techniques for feedforward neural networks, including weight quantization, complexity reduction algorithms, and a CAD tool, leading to lower area, energy, and latency.
Contribution
It proposes novel methods for quantizing weights, tuning them to reduce hardware complexity, and implementing multiplications without multipliers, along with an automated design tool.
Findings
Quantization reduces hardware complexity significantly.
Multiplierless implementation decreases area and energy consumption.
Tuning algorithms maintain accuracy while simplifying hardware.
Abstract
This article presents design techniques proposed for efficient hardware implementation of feedforward artificial neural networks (ANNs) under parallel and time-multiplexed architectures. To reduce their design complexity, after the weights of ANN are determined in a training phase, we introduce a technique to find the minimum quantization value used to convert the floating-point weight values to integers. For each design architecture, we also propose an algorithm that tunes the integer weights to reduce the hardware complexity avoiding a loss in the hardware accuracy. Furthermore, the multiplications of constant weights by input variables are implemented under the shift-adds architecture using the fewest number of addition/subtraction operations found by prominent previously proposed algorithms. Finally, we introduce a computer-aided design (CAD) tool, called SIMURG, that can describe…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Neural Network Applications · Model Reduction and Neural Networks
