# Stochastic Sparse Learning with Momentum Adaptation for Imprecise   Memristor Networks

**Authors:** Yaoyuan Wang, Shuang Wu, Ziyang Zhang, Lei Tian, Luping Shi

arXiv: 1906.02393 · 2019-06-07

## TL;DR

This paper introduces a novel stochastic sparse learning scheme with momentum adaptation to effectively train imprecise memristor neural networks, significantly improving accuracy, reducing updates, and accelerating convergence for neuromorphic computing.

## Contribution

The paper proposes a new learning scheme combining stochastic sparse updates and momentum adaptation specifically designed for imprecise memristor networks, enhancing training robustness and efficiency.

## Key findings

- Classification accuracy improved from 26.12% to 90.07% in MLP
- Accuracy increased from 65.98% to 92.38% in CNN
- Training pulses decreased by 90% in MLP and 40% in CNN

## Abstract

Memristor based neural networks have great potentials in on-chip neuromorphic computing systems due to the fast computation and low-energy consumption. However, the imprecise properties of existing memristor devices generally result in catastrophic failures for the network in-situ training, which significantly impedes their engineering applications. In this work, we design a novel learning scheme that integrates stochastic sparse updating with momentum adaption (SSM) to efficiently train the imprecise memristor networks with high classification accuracy. The SSM scheme consists of: (1) a stochastic and discrete learning method to make weight updates sparse; (2) a momentum based gradient algorithm to eliminate training noises and distill robust updates; (3) a network re-initialization method to mitigate the device-to-device variation; (4) an update compensation strategy to further stabilize the weight programming process. With the SSM scheme, experiments show that the classification accuracy on multilayer perceptron (MLP) and convolutional neural network (CNN) improves from 26.12% to 90.07% and from 65.98% to 92.38%, respectively. Meanwhile, the total numbers of weight updating pulses decrease 90% and 40% in MLP and CNN, respectively, and the convergence rates are both 3x faster. The SSM scheme provides a high-accuracy, low-power, and fast-convergence solution for the in-situ training of imprecise memristor networks, which is crucial to future neuromorphic intelligence systems.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.02393/full.md

## Figures

14 figures with captions in the complete paper: https://tomesphere.com/paper/1906.02393/full.md

## References

26 references — full list in the complete paper: https://tomesphere.com/paper/1906.02393/full.md

---
Source: https://tomesphere.com/paper/1906.02393