Training of Quantized Deep Neural Networks using a Magnetic Tunnel Junction-Based Synapse
Tzofnat Greenberg Toledo, Ben Perach, Itay Hubara, Daniel Soudry and, Shahar Kvatinsky

TL;DR
This paper presents a novel MTJ-based hardware synapse circuit that enables efficient in-memory training of quantized neural networks, achieving high accuracy with reduced data movement and promising energy efficiency.
Contribution
It introduces a new MTJ-based hardware synapse supporting QNN training, enabling processing near memory and demonstrating high accuracy and energy efficiency.
Findings
Achieved over 98% accuracy on MNIST with TNN.
Demonstrated in-memory training with 18.3 TOPs/W energy efficiency.
Reduced data movement by processing near memory.
Abstract
Quantized neural networks (QNNs) are being actively researched as a solution for the computational complexity and memory intensity of deep neural networks. This has sparked efforts to develop algorithms that support both inference and training with quantized weight and activation values, without sacrificing accuracy. A recent example is the GXNOR framework for stochastic training of ternary (TNN) and binary (BNN) neural networks. In this paper, we show how magnetic tunnel junction (MTJ) devices can be used to support QNN training. We introduce a novel hardware synapse circuit that uses the MTJ stochastic behavior to support the quantize update. The proposed circuit enables processing near memory (PNM) of QNN training, which subsequently reduces data movement. We simulated MTJ-based stochastic training of a TNN over the MNIST, SVHN, and CIFAR10 datasets and achieved an accuracy of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
