E-PUR: An Energy-Efficient Processing Unit for Recurrent Neural Networks
Franyell Silfa, Gem Dot, Jose-Maria Arnau, Antonio Gonzalez

TL;DR
E-PUR is a specialized energy-efficient processing unit designed for LSTM-based RNNs, significantly reducing energy consumption and improving real-time performance on mobile devices through innovative memory locality techniques.
Contribution
The paper introduces E-PUR, a novel hardware architecture for LSTM RNNs, featuring Maximizing Weight Locality (MWL) to enhance memory efficiency and energy savings.
Findings
Achieves real-time LSTM performance on mobile devices.
Reduces energy consumption by up to 92x compared to NVIDIA Tegra X1.
Requires a very small chip area.
Abstract
Recurrent Neural Networks (RNNs) are a key technology for emerging applications such as automatic speech recognition, machine translation or image description. Long Short Term Memory (LSTM) networks are the most successful RNN implementation, as they can learn long term dependencies to achieve high accuracy. Unfortunately, the recurrent nature of LSTM networks significantly constrains the amount of parallelism and, hence, multicore CPUs and many-core GPUs exhibit poor efficiency for RNN inference. In this paper, we present E-PUR, an energy-efficient processing unit tailored to the requirements of LSTM computation. The main goal of E-PUR is to support large recurrent neural networks for low-power mobile devices. E-PUR provides an efficient hardware implementation of LSTM networks that is flexible to support diverse applications. One of its main novelties is a technique that we call…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
