Training LSTM Networks with Resistive Cross-Point Devices
Tayfun Gokmen, Malte Rasch, Wilfried Haensch

TL;DR
This paper demonstrates that resistive cross-point devices can accelerate training of LSTM networks, with careful consideration of device imperfections and system parameters affecting performance and accuracy.
Contribution
The work extends the RPU concept to recurrent neural networks, analyzing the impact of device imperfections and proposing solutions like stochastic rounding.
Findings
Symmetry of updates is critical for RNN training accuracy.
Input signal resolution needs to be at least 7 bits, reduced to 5 bits with stochastic rounding.
Device variations and noise can reduce overfitting, lessening the need for dropout.
Abstract
In our previous work we have shown that resistive cross point devices, so called Resistive Processing Unit (RPU) devices, can provide significant power and speed benefits when training deep fully connected networks as well as convolutional neural networks. In this work, we further extend the RPU concept for training recurrent neural networks (RNNs) namely LSTMs. We show that the mapping of recurrent layers is very similar to the mapping of fully connected layers and therefore the RPU concept can potentially provide large acceleration factors for RNNs as well. In addition, we study the effect of various device imperfections and system parameters on training performance. Symmetry of updates becomes even more crucial for RNNs; already a few percent asymmetry results in an increase in the test error compared to the ideal case trained with floating point numbers. Furthermore, the input…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
