In-situ Stochastic Training of MTJ Crossbar based Neural Networks
Ankit Mondal, Ankur Srivastava

TL;DR
This paper proposes an in-situ stochastic training method for MTJ crossbar neural networks, leveraging their inherent parallelism and stochastic switching to efficiently train binary weights directly on hardware, achieving accuracy comparable to software-trained networks.
Contribution
It introduces a novel in-situ training approach utilizing stochastic MTJ switching, reducing training overhead and improving robustness to device variations.
Findings
Achieves near software-level classification accuracy.
Demonstrates robustness against manufacturing variations.
Reduces training overhead by in-situ hardware training.
Abstract
Owing to high device density, scalability and non-volatility, Magnetic Tunnel Junction-based crossbars have garnered significant interest for implementing the weights of an artificial neural network. The existence of only two stable states in MTJs implies a high overhead of obtaining optimal binary weights in software. We illustrate that the inherent parallelism in the crossbar structure makes it highly appropriate for in-situ training, wherein the network is taught directly on the hardware. It leads to significantly smaller training overhead as the training time is independent of the size of the network, while also circumventing the effects of alternate current paths in the crossbar and accounting for manufacturing variations in the device. We show how the stochastic switching characteristics of MTJs can be leveraged to perform probabilistic weight updates using the gradient descent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
