Boosting LSTM Performance Through Dynamic Precision Selection

Franyell Silfa; Jose-Maria Arnau; Antonio Gonz\`alez

arXiv:1911.04244·eess.SP·November 12, 2019

Boosting LSTM Performance Through Dynamic Precision Selection

Franyell Silfa, Jose-Maria Arnau, Antonio Gonz\`alez

PDF

TL;DR

This paper introduces a hardware scheme for LSTM inference that dynamically adjusts numerical precision based on cell state changes, leading to significant speed and energy efficiency improvements without accuracy loss.

Contribution

It proposes a novel hardware approach for dynamic precision selection in LSTMs, improving efficiency over fixed-precision methods.

Findings

01

Over 66% of time steps use lower precision

02

Achieves 1.56x speedup and 23% energy savings

03

No accuracy loss observed

Abstract

The use of low numerical precision is a fundamental optimization included in modern accelerators for Deep Neural Networks (DNNs). The number of bits of the numerical representation is set to the minimum precision that is able to retain accuracy based on an offline profiling, and it is kept constant for DNN inference. In this work, we explore the use of dynamic precision selection during DNN inference. We focus on Long Short Term Memory (LSTM) networks, which represent the state-of-the-art networks for applications such as machine translation and speech recognition. Unlike conventional DNNs, LSTM networks remember information from previous evaluations by storing data in the LSTM cell state. Our key observation is that the cell state determines the amount of precision required: time steps where the cell state changes significantly require higher precision, whereas time steps where the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory