Training of Quantized Deep Neural Networks using a Magnetic Tunnel   Junction-Based Synapse

Tzofnat Greenberg Toledo; Ben Perach; Itay Hubara; Daniel Soudry and; Shahar Kvatinsky

arXiv:1912.12636·cs.ET·May 31, 2022

Training of Quantized Deep Neural Networks using a Magnetic Tunnel Junction-Based Synapse

Tzofnat Greenberg Toledo, Ben Perach, Itay Hubara, Daniel Soudry and, Shahar Kvatinsky

PDF

TL;DR

This paper presents a novel MTJ-based hardware synapse circuit that enables efficient in-memory training of quantized neural networks, achieving high accuracy with reduced data movement and promising energy efficiency.

Contribution

It introduces a new MTJ-based hardware synapse supporting QNN training, enabling processing near memory and demonstrating high accuracy and energy efficiency.

Findings

01

Achieved over 98% accuracy on MNIST with TNN.

02

Demonstrated in-memory training with 18.3 TOPs/W energy efficiency.

03

Reduced data movement by processing near memory.

Abstract

Quantized neural networks (QNNs) are being actively researched as a solution for the computational complexity and memory intensity of deep neural networks. This has sparked efforts to develop algorithms that support both inference and training with quantized weight and activation values, without sacrificing accuracy. A recent example is the GXNOR framework for stochastic training of ternary (TNN) and binary (BNN) neural networks. In this paper, we show how magnetic tunnel junction (MTJ) devices can be used to support QNN training. We introduce a novel hardware synapse circuit that uses the MTJ stochastic behavior to support the quantize update. The proposed circuit enables processing near memory (PNM) of QNN training, which subsequently reduces data movement. We simulated MTJ-based stochastic training of a TNN over the MNIST, SVHN, and CIFAR10 datasets and achieved an accuracy of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.