TL;DR
This paper presents a highly efficient combinational circuit approach for implementing activation functions like SELU and tanh in neural network accelerators, achieving significant area and speed improvements with minimal accuracy loss.
Contribution
It introduces a novel combinational circuit design methodology for activation functions, optimizing hardware efficiency and demonstrating negligible impact on neural network accuracy.
Findings
Circuit designs for tanh and SELU save significant area compared to LUT-based implementations.
The proposed circuits operate at high frequencies (over 4 GHz).
Neural networks show insensitivity to activation function precision.
Abstract
The widespread application of artificial neural networks has prompted researchers to experiment with FPGA and customized ASIC designs to speed up their computation. These implementation efforts have generally focused on weight multiplication and signal summation operations, and less on activation functions used in these applications. Yet, efficient hardware implementations of nonlinear activation functions like Exponential Linear Units (ELU), Scaled Exponential Linear Units (SELU), and Hyperbolic Tangent (tanh), are central to designing effective neural network accelerators, since these functions require lots of resources. In this paper, we explore efficient hardware implementations of activation functions using purely combinational circuits, with a focus on two widely used nonlinear activation functions, i.e., SELU and tanh. Our experiments demonstrate that neural networks are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
