Boosting LSTM Performance Through Dynamic Precision Selection
Franyell Silfa, Jose-Maria Arnau, Antonio Gonz\`alez

TL;DR
This paper introduces a hardware scheme for LSTM inference that dynamically adjusts numerical precision based on cell state changes, leading to significant speed and energy efficiency improvements without accuracy loss.
Contribution
It proposes a novel hardware approach for dynamic precision selection in LSTMs, improving efficiency over fixed-precision methods.
Findings
Over 66% of time steps use lower precision
Achieves 1.56x speedup and 23% energy savings
No accuracy loss observed
Abstract
The use of low numerical precision is a fundamental optimization included in modern accelerators for Deep Neural Networks (DNNs). The number of bits of the numerical representation is set to the minimum precision that is able to retain accuracy based on an offline profiling, and it is kept constant for DNN inference. In this work, we explore the use of dynamic precision selection during DNN inference. We focus on Long Short Term Memory (LSTM) networks, which represent the state-of-the-art networks for applications such as machine translation and speech recognition. Unlike conventional DNNs, LSTM networks remember information from previous evaluations by storing data in the LSTM cell state. Our key observation is that the cell state determines the amount of precision required: time steps where the cell state changes significantly require higher precision, whereas time steps where the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
